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tion to isolate additional NORK genes and proteins from different legumes or other plants, and their use for producing transgenic 
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steps of symbiotic nitrogen fixation, as well as to methods of producing transgenic plant cells and cloned entire plants wherein the 
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Identification and cloning of the receptor gene for symbiotic 

nitrogen fixation 

TECHNICAL FIELD 

5 

The invention relates to the Medicago sativa DNA sequence encoding the 
NORK polypeptide as well as to the protein determined by this DNA sequence. In 
another aspect, the invention relates to conserved DNA regions, primers, DNA 
probes, polypeptides and corresponding antibodies characteristic to the NORK 
iO gene family, as well as to methods and their utilization to isolate additional NORK 
genes and proteins from different legumes or other plants, and their use for 
producing transgenic plants. 

In another aspect, the invention relates to a method of cloning a DNA 
15 sequence encoding a function involved in one of the steps of symbiotic nitrogen 
fixation, as well as to methods of producing transgenic plant cells and cloned entire 
plants wherein the biosynthesis of the NORK polypeptide takes place. 

The invention also relates to the identification of a gene present in 
20 leguminous plants, encoding a function indispensable for symbiotic nitrogen 
fixation, to the sequencing of the gene, confirmation of the biological function of 
the gene, and the transformation of the gene into plants unable for symbiotic 
nitrogen fixation. 

25 BACKGROUND ART 

It is well known that symbiotic nitrogen fixation covers the requirements of 
the nitrogen demands of a plant and ensures normal growth, development and 
seed production in soils with limited amount of fixed nitrogen without added 
30 artificial nitrogen fertilizers (1). Plants capable of symbiotic fixation differ from other 
plants in that they carry all of the genetic determinants in their genome which are 
needed for the biological process of symbiotic nitrogen fixation. For those experts 
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who are familiar with the recombinant DNA techniques it is obvious that these 
genes can be identified, isolated and transferred to other plants on such a way that 
they retain their function. In theory, all genes required for symbiotic nitrogen 
fixation can be transferred to recipient plants which eventually will be able to 
5 establish symbiotic nitrogen fixation, consequently, they can be grown on a more 
economical and environmental friendly manner. 

The identification of genes involved in symbiotic nitrogen fixation are being 
studied very intensively worldwide, still the nucleotide and amino acid sequences 
10 which are presented in this patent application are completely novel, we had access 
only to randomly sequenced cDNA sequences. 

The symbiotic nitrogen fixation is a biological process established between 
particular dicotyledonous plants and soil bacteria (2, 3). The symbiotic nitrogen 

15 fixation takes place mostly in the root system where root nodules develop, and are 
invaded by the bacteria. During the development of root nodules bacteria enter the 
plant cells and change to bacteroids, and in such form they fix nitrogen that is they 
convert the inert dinitrogen of the air into ammonia. This ammonia (which is a form 
of the so-called fixed nitrogen) can be utilized by the plants for their development. 

20 Since no artificial nitrogen fertilization is needed, symbiotic nitrogen fixation is an 
environmental friendly, cost effective and improve the quality of the soil, therefore, 
these plants are very suitable as green crops. Agricultural legumes, which are 
capable to fix the nitrogen from the air, are therefore very important part of the 
agriculture. Their seeds and green mass contribute to large extent to the human 

25 and animal amino acid and oil supply, therefore, legumes are extremely important 
in the worldwide food supply. 

Some soil bacteria, like Rhfzobia and Actinomycetes, are able to establish 
symbiotic nitrogen fixation relationship with given plants, like many legumes, 
30 Trema, Casuarina, Alnus, Parasponia (2, 3). 
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As a consequence of the evolution this symbiotic relationship is strictly 
species specific, a given group of bacteria establishes symbiosis only with a given 
group of plants (2, 3, 4, 5), Such symbiosis is established between Sinorhizobium 
meliloti and Medicago, Melilotus, Trigonella species (6), between Rhizobium 
5 leguminosarum bv. viciae and Vicia species, between Rhizobium leguminosarum 
bv. trifolii and given members of Trifolium species, between Rhizobium etli and 
Phaseolus species, between Bradyrhizobium japonicum and Glycine max, 
between Azorhizobium caulinodans and Sesbania rostrata, between 
Mesorhizobium loti and Lotus japonicus (7). 

10 

The biogenesis and development of the symbiotic nodule, as well as the 
invasion of the plants cells by the bacteria is a complex, multi-step process taking 
place through series of gene-interaction involving both bacterial and plant 
symbiotic genes. The formation of the nodule, which is inevitable for the symbiotic 

15 nitrogen fixation according to our present knowledge, starts to develop after a 
specific signal exchange between the microsymbiont (the bacteria) and the 
macrosymbiont (the plants). Through the root system, the macrosymbionts excrete 
specific flavonoid and other molecules into the environment, which are 
characteristic to and determined by the genetic material of the plant species (8, 9). 

20 As a response to these excreted molecules compatible bacteria approach by 
chemotaxis the plant root system (10) and start to produce a signal molecule, the 
so-called lipo-chitooligosaccharides abbreviated as Nod factors, which in turn are 
specific nodulation signal for the plant (11, 12, 13). During nodule formation 
compatible bacteria attach to the root-hair surface on a polar manner. In this 

25 joining special molecules like bacterial pili, plant lectins and other adhesive 
molecules are active components (14). Upon the action of the symbiotic specific 
Nod factors, characteristic plant responses occur like root-hair deformation, curling 
and branching (15). Inside the root tissue, the cortical cells start to divide. As a 
response of the specific Nod factor produced by the compatible bacteria a 

30 symbiotic nodule start to develop on the root or on the stem depending on the 
type of symbiosis. Some of the specific Nod factors are shown in Fig, 1 . without 
the sake of completeness (5, 7). As a consequence of the root-hair curling 
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provoked by the specific Nod factor, the bacteria become surrounded and enter 
the so-called infection threads produced by the plant. The infection threads start to 
grow towards the core tissue of the plant, where they start to branch. Inside the 
infection thread, the bacteria divide and move along the infection threads. At the 
5 same time the cortical cells continue to divide and the nodule meristeme is formed. 
The branching infection threads enter the dividing cells and the bacteria are 
released inside the plants cells, where they continue to divide and in a short time 
they occupy most of the plant cytoplasm establishing the so-called infected cells. 
In the Infected cells bacteria are surrounded by plant plasma membrane 

10 (peribacteroid membrane) in which the bacteria change to bacteroids, in which 
form they are capable to fix nitrogen (16, 17). The derepression of the nitrogenase 
enzyme complex takes place only in the bacteriods. The bacteroids, with the help 
of their nitrogenase enzyme complex, can reduce the inert dinitrogen into 
ammonia, the so-called combined nitrogen, which can be utilized by the 

15 macrosymbiont. Most of the ammonia formed during nitrogen fixation (that part 
which is not utilized by the bacteroids) is excreted by the bacteroids. The excreted 
ammonia is taken up by the nodule cells and transferred towards the vascular 
tissue by a proper conducting mechanism. The fully developed and active nodules 
are able to supply the plants with enough combined nitrogen which is needed for 

20 their normal growth and development. 

Root-hair deformation and cortical cell division can be evoked by very low 
(10" 8 -10" 12 M) concentration of the compatible Nod factor even if the bacteria are 
not present, if the Nod factor is applied to the root. Following Nod factor treatment 

25 normal nodule structures are formed on the plant roots, in which no bacteria are 
present. From this phenomenon it is obvious, that the Nod factors are recognized 
by the root cells, which in turn start the specific signal transduction pathway 
leading to nodule formation. The recognition of the excreted Nod factor on the 
surface of the root cells is taking place by an unknown mechanism. Nod factor 

30 binding receptors were looked for by biochemical methods, but no such a 
supposed protein responsible for the binding and transmission of the Nod factor 
could have been identified so far (18). However, it is generally accepted, that the 
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bacterial Nod factor is recognized and bound by a specific receptor and through 
this mechanism the signal transduction cascade evokes nodule induction and 
development. 

5 The steps leading to nodule development, the functions involved, and the 

genes coding for these functions in the plants are not known at present, however, 
one can understand that the presence of the microsymbiont for the formation of 
the nodule structure is not necessary. Some plants like alfalfa {Medicago sativa 
L.) is able to form spontaneous nodules. Under sterile conditions and in the 

10 absence of microbes and higher concentration of combined nitrogen (more than 1 
mM) alfalfa develops some nodule-like structure, which are very similar to the 
morphology and tissue characteristics of the normal nodules (19). As a 
consequence, all those genes and functions, which are necessary for nodule 
development are encoded by the plant genetic material. However, the Nod factors 

15 produced by the compatible microosymbiont are responsible to what extend, and 
what type of nodules are formed by the plant. 

Several genes, both bacterial and plant origin, are involved in the 
development and effective functioning of the nodules. Most of the genes in S. 

20 Meliloti and in other Rhizobia are knnown (5, 20, and personal information from 
Michael Gottfert, Technische Universitat Dresden, Germany). According to Prof. 
Alfred Puhler, University of Bielelfeld, Germany the nucleotide sequence of the 
entire S. meliloti geome is known. However, the identification and the function of 
the plant genes involved in this biological process is still in its infancy. At present, 

25 biochemical and combined genetic and biochemical approaches are applied to 
study plant genes. Genes identified by biochemical methods are questionable as 
to whether they are involved in symbiotic nitrogen fixation. The nodule specific 
expression of many genes isolated by biochemical approaches was demonstrated 
nevertheless their real function was not uncovered (21, 22, 23), or it turned out 

30 that they are not essential to effective symbiotic nitrogen fixation (24). 
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Among others the Medicago EST programs will be useful to get acquainted 
with the nodule specific genes. These programs generated until now more than 
100 000 partial EST sequences. These sequences are open for public on the 
Internet ( http://www.ncbLnlm.nih.gov). Among these sequences there are three 
5 partial cDNA sequences homologous to the NORK gene of M. truncatula. Form the 
mere sequence, however, it is impossible to predict the function of the gene, 
therefore more analysis is needed to reveal the function. The most powerful tool 
for the study of gene functions if mutant alleles of the genes are available. One 
can generate a mutant allele of a gene, or one can isolate a gene from a mutant 

10 allele. The first possibility (homologue recombination) is not available for plants. 
The later, however, is feasible through map based cloning or insertion 
mutagenesis (tranposons, T-DNA) based gene isolation procedures. According to 
mutant phenotype (T-DNA insertion) only one gene has been isolated form Lotus 
iaoonicus: this gene (nin) encoded a transcription factor (25). In this patent we 

15 describe for the first time the isolation of a gene by map based cloning that is the 
isolation of the NORK gene with the help of the Medicago sativa mutant MN1 008. 

Classical genetic strategy leads to unambiguous results. The isolated gene 
is an essential component of the biological process in question if the gene affected 

20 by the mutation has a phenotype (unable to carry out certain functions or less 
active in a given function) and the gene isolated based on the phenotype. In the 
mutant individuals an essential function affected which is involved in the biological 
process studied. Appropriate methods are available by which the mutant gene ant 
ist wild type counterpart can be isolated. In order to acquire mutant individuals 

25 mutations can be induced by physical, chemical or biological means. Mutant 
genes, in which no insertions of known sequences are present, or the nature of 
the mutation is not known (like in the case of large deletions, inversions) can be 
isolated by the so-called map based cloning. The principal of the map based 
cloning is the following: first the phenotype is mapped genetically with molecular 

30 (DNA) markers, and then with the help of these markers chromosomal walking is 
performed and the gene is isolated form clones originated form large insert 
libraries (26). 
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Mutant plants impaired in symbiotic nitrogen fixation can be grouped into 
two categories: mutants in the first class are the Nod" mutants that is no nodules 
appear on their root system; mutants in the second class are the Fix" mutants 
5 which develop nodules or nodule like structures but the nodules are ineffective, 
that is no effective Oirogen fixation takes place. It is a characteristic phenomenon 
of both mutant types, that in the presence of the compatible bacteria and in the 
absence of combined nitrogen the symptoms of nitrogen deficiency develop on the 
mutants, that is yellow leaves and stunted growth. There are many symbiotic 
10 mutants available published in the literature. Among the Nod" mutants the most 
important ones are the Medicago sativa MN1008 mutant (27, 28), the Medicago 
truncatula B129, Tr25, Trv25, B85 mutants (29), the Pisum sativum sym8, sym10, 
syn?19 mutants (30), the Lotus japonicus sym2 mutant (31), and the Glycine max 
nod49 and noc/139 mutants (14). 

15 

The MnNC-1008 (NN) alfalfa (Medicago sativa) mutant (abbreviated as 
MN1008) is incapable to form nodules on its root system in the presence of its 
compatible microsymbiont (Sinorhizobium meliloti) (27, 28). Further studies 
showed that there was no sign of any response detectable by microscopy, which 

20 was characteristic to the wild type plant early nodulation response. There was no 
root hair deformation and cortical cell division (32), in addition to the lack of the 
specific calcium spiking response (special calcium oscillation inside the cells) (33). 
According to unpublished results, there was no root hair curling or cortical cell 
division observed in the mutant MN1008 (unlike in the wild type plant) upon 

25 addition the specific S. meliloti Nod factor. Mutant MN1008, on the other hand, was 
able to develop nodules, the so-called spontaneous nodules, in the absence of 
microbes (see above). This phenomenon reflects that the plant nodulation program 
was intact. From the above results the most plausible conclusion was that the 
alfalfa mutant MN-1008 had a defect in the perception of the Nod factor, that is the 

30 assumed Nod factor receptor gene suffered mutation, or one of the member in the 
early signal transduction chain. 
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DISCLOSURE OF INVENTION 

In agreement with the patent the knowledge and use of the DNA sequence, 
the described technology, specifically the isolation of the Nod factor receptor gene, 
5 or individual genes in the signal transduction has a great importance in the 
agriculture. The transfer of these genes into plants which are unable to fix nitrogen 
but have great agricultural importance, plants like rice, corn, wheat, barley, rye and 
so on, may ensure the possibility, that in the presence of appropriate Nod factors 
produced by Rhizobium bacteria the transgenic plants carrying and expressing 
10 Nod factor receptor genes and genes of the member of the signal transduction 
chain might be able to perform the first biological steps leading to nodule 
formation. Cloning and transfer other nodule specific genes might lead to the 
formation of normal symbiotic nodule and nitrogen fixation. 

15 In accordance with the above, the invention relates to the Medicago sativa 

DNA sequence encoding the NORK polypeptide as well as to the protein 
determined by the mentioned DNA sequence. In addition the invention features 
conserved DNA regions, primers, DNA probes, polypeptides and corresponding 
antibodies characteristic to the NORK gene family, as well as to methods and their 

20 utilization to isolate additional NORK genes from different legumes or other plants, 
and to use those genes for producing transgenic plants. 

In another aspect, the invention features a method of cloning a DNA 
sequence encoding a function involved in one of the steps of symbiotic nitrogen 
25 fixation, as well as to methods of producing transgenic plant cells and cloned entire 
plants wherein the biosynthesis of the NORK polypeptide expresses. 

The experiments described in this invention relates to the identification and 
isolation of genetic determinants (genes) involved in the determination and 
30 development of symbiotic nodules of alfalfa {Medicago sativa) and in the 
perception and/or signal transduction of the Nod factor originating from the 
bacteria, with the help of the MN-1008 mutant incapable for Nod factor triggered 
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nodule development, on such a way, that the mutant phenotype is genetically 
mapped with the use of molecular markers, followed by the isolation of overlapping 
BAG clones using the tightly linked markers and by recombinant DNA techniques, 
then sequencing the genes in the overlapping BAG clones, and finally sequencing 
5 the mutant alleles responsible for the mutant phenotype. Applying the above 
procedure the following nucleotide sequence was identified: 

CCATATTTTAACAATATTCTTTCTTCTACAAGGGTATAACTTTTATACAAGTTCACTATA 
TTATAGGATTGATCAAGGTTCATTTTTTCTTTCTTTGAAAAATCTCTAAGGGGTGTGGTT 

10 TCCAAGGCAGAAAATGAAATAGAATGCAGAAGAATTTGTATGGTACTATAAAGGGAAGAT 
GAAAAGTTAGTTAGCATGGATTCAAGTTTGATAACCCTTTGGGGTAAAATCTCTTTCAGA 
TT ATGA TGGAGCTACAAGTTATTAGGATATTTAGATTGGTTGTGGCATGTGTTCTTTGTT 
TGTGTATATTTATCAGATCAGCTTCTTCTGCAACTAAAGGGTTTGAGAGCATATCATGTT. 
GTGCTGATTCCAATTACACAGATCCAAAAACAACCCTAACTTATACAACAGATCACATCT 

1 5 GGTTCTCTGATAAAAGAAGTTGCAGACCAATACCCGAAATTTTGTTTAGCCACAGAAGCA 
ATAAAAATGTTCGAATATTTGAAATAGATGAAGGAAAGAGATGTTATACTTTGCCAACAA 
TTAAGGATCAAGTATATTTGATAAGGGGTGTATTTCCCTTTGATAGTTTAAATTCTTCGT 
TTTATGTTTATATCGGGGTAACAGAACTAGGTGAATTAAGATCGTCTAGACTCGAGGACT 
TGGAAATCGAGGGAGTTTTTAGAGCCACCAAAGACTATATTGATTTCTGCTTATTGAAGG 

20 AAGATGTCAATCCCTTCATTTCTCAGATTGAATTGAGGCCATTACCTGAAGAATACCTAC 
ATGGTTTCGCTACTAGTGTTTTl^AAACTGATAAGCAGAAATAATCTTGGTGACACAAATG 
ATGATATAAGGTTCCCAGATGACCAAAATGATAGAATCTGGAAACGGAAAGCAACTTCAA 
CTCCATCATCTGCCCTTCCCCTGTCTTCCAATGTCAGCAATGTTGACCTCAAAGACAGTG 
TCACACCTCCTCTACAAGTCCTACAAACAGCTCTTACTCACCCTGAGCGATTGGAGTTCG 

25 TCCATGATGGCCTCGAGACCGATGATTATGAATACTCTGTGTTTCTCCACTTTCTTGAAC 
TAAATGGCACTGTCAGAGCAGGACAAAGGGTGTTTGACATCTATCTAAACAATGAGATTA 
AAAAGGAGAAGTTTGATGTTTTGGCTGGAGGGTCCAAGAACAGTTACACTGCCTTGAACA 
TTTCAGCAAATGGATCACTCAATATAACCTTAGTCAAGGCATCTGGATCTGAGTTTGGAC 
CCCTTTTGAATGCCTATGAAATCCTGCAGGCACGGTCGTGGATTGAAGAGACCAACCAAA 

30 AAGATTTGGAACTTATTCAGAAGATGAGAGAAGAACTGCTGCTGCACAACCGAGAAAATG 
AAGCATTGGAGAGTTGGAGTGGAGACCCTTGTATGATTTTCCCCTGGAAAGGAATAACAT 
GTGATGATTCAACTGGTTCATCTATTATCACTATGCTGGATCTTTCTTCCAATAATCTCA 
AGGGAGCAATTCCTTACTTTGTCACTAAGATGACCAATTTACAAATACTGAACCTGAGCC 
ACAACCAGTTCGATTCGTTATTCCCCTCGTTTCCACCGTCCTCCTTGCTGATATCATTGG 

35 ATCTGAGCTACAATGATCTTGATGGACGGCTTCCAGAATCCATTATCTCACTGCCACATT 
TAAAATCATTATATTTTGGCTGCAATCCATATATGAAGGACGAAGATACAACAAAGTTGA 
ACAGTTCACTAATCAATACAGATTATGGGAGATGCAAAGGAAAAAAACCAAAGTTTGGAC 
AAGTATTCGTGATTGGAGCTATTACAAGGGGATCACTTTTGATTACTTTGGCTGTTGGAA 
TTCTATTTTXTTGCCGTTATAGACACAAGTCAATTACTTTGGAAGGATTTGGTGGAAAGA 
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CCTACCCAATGGGAACAAATATAATCTTCTCTTTGCCAAGCAAAGACGATTTCTTCATAA 
AGTCTGTATCAGTTAAACCATTCACTTTGGAGTATATAGAGCAGGCTACAGAACAGTACA 
AAACTTTGATAGGTGAAGGAGGATTTGGTTCTGTTTACAGAGGCACTCTAGACGATGGTC 
AAGAAGTGGCAGTGAAAGTGCGGTCATCCACATCAACTCAGGGAACCCNAG2^ATTTGATA 
5 ATGAGCTAAACCTACTTTCAGCTATACAACATGAGAACCTGGTGCCTCTTCTGGGTTACT 
GTAATGAGTATGATCAACAAATTCTCGTGTATCCATTCATGTCCAATGGCTCTTTGCTAG 
ATAGACTATACGGGGAAGCATCAAAGAGAAAAATATTAGACTGGCCAACTAGACTCTCTA 
TTGCTCTCGGTGCAGCTCGAGGTTTGGCATATCTTCACACATTTCCAGGACGTTCTGTAA 
TACACAGGGACGTAAAATCGAGCAATATACTGCTGGATCAGAGCATGTGTGCTAAGGTTG 

10 CAGATTTTGGTTTCTCAAAATACGCTCCTCAGGAAGGAGACAGTTATGTTTCCCTTGAAG 
TAAGAGGAACTGCAGGGTATCTGGATCCTGAGTACTACAAAACCCAGCAATTATCTGAAA 
AAAGTGATGTTTTCAGCTTTGGTGTGGTTCTACTTGAAATTGTAAGTGGACGGGAACCTC 
TCAACATAAAGAGACCACGGATCGAGTGGAGCTTGGTTGAATGGGCTAAACCATACATAA 
GAGCATCAAAGGTGGATGAAATTGTAGATCCTGGCATCAAGGGAGGATATCATGCAGAAG 

1 5 CATTGTGGAGAGTTGTGGAAGTAGCACTGCAATGTCTAGAACCCTACTCAACATATAGGC 
CATGCATGGTTGATATTGTCCGCGAGTTGGAGGATGCTCTCATTATTGAAAACAATGCAT 
CTGAATACATGAAATCCATAGACAGCCTTGGAGGATCCAACCGCTACTCAATTGTTATGG 
ACAAACGGGCGCTGCCTTCAACTACATCTACAGCAGAATCAACTATCACAACCCAAACCT 
TGACACACCCTCAACCGAGATAGTAAATGGGTCGATGGAATTCTTTTGATTTGTTTTTTA 

20 TCATTGCTTTAGTAATATCCCATTTTAAATGGTAAAGGAGAAAAATACTACTTTTGATTG 
TATTTTCATCCACTCTATGTTTCTTGAAACTGAATCTCTCTTGCTCAGCCCCAGTTTTTA 
TGGGTGAAAAGAATAATTTGGGTCAAATGCAAGTGAAACCATATGGTGCATAATTTGAAA 
GCCATATTATATCATTTGCTAAGTCCAAAGTAAAAATTTCACAAACTAGTTAGATTGCGA 
TTTAGTCTATACACACTTCAACAGAGCTATATACACTAT 

25 

The invention relates to the above genomic or synthetic nucleotide 
sequence molecules, to the amino acid sequence deduced from this DNA, to its 
deletion or mutant derivatives, to its recombinant forms, and to any other proteins 
having NORK specific biological activity. The invention relates to those 
30 oligonucleotides which are hybridizing to the above mentioned nucleotide 
sequence. In another aspect, the invention features those antibodies suitable for 
the detection of NORK polypeptides. In addition the invention relates to cells and 
transgenic plants as well transformed with DNA molecules protected by this 
invention. 



35 



The invention presents a procedure by which certain plants with Nod" 
phenotype can be converted to Nod + phenotype. 
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To achieve the aim of the invention the MN-1008 mutant plant was grown. 
As opposed to the wild type plant, mutant MN-1008 respond (does not start 
nodulation consequently does not form nodules) neither to the compatible 
5 Sinorhizobium meliloti bacterium, nor to the Nod factor produced by the bacteria. It 
is supposed that the mutation in MN-1008 plant affected one of the genes involved 
in the signal perception and/or one of the early steps in the signal transduction 
pathway, it is likely the mutation affected the Nod factor receptor genes. 

10 Under appropriate conditions, e.g. in the presence of sufficient nitrogen 

supply or in soil mutant MN1008 can grow and start to .flourish. Mutant MN1008 
was crossed with a nodulating wild type plant, the F1 generation and then by self- 
mating the F2 populations were established. In the F2 population (mapping or 
segregating population) the traits are segregating. From the individuals of the F2 

15 population Nod" and Nod + individuals were colleted (see Example 1.) and linked 
genetic markers were looked for (see Example 2.). With the help of the diploid 
mapping population the linked markers were mapped and it was concluded that all 
these markers mapped in the same region (see Example 3.). According to the 
map position of the diploid map, linked markers were selected from this region, 

20 which were mapped in the Nod" segregating population (see Example 4.). It turned 
out that these markers were linked to the Nod" phenotype, therefore mapped in the 
region where the Nod" mutation mapped to, and two of these were tightly linked 
(see Example 4.). 

25 With the help of the tightly linked markers so-called BAC clones were 

isolated from the BAC library of M. truncatula, followed by the ordering of the BAC 
clones into so-called contigs (see Example 5.). The so-called Nod contig was 
about a 500 kilobase-pairs (kb) long DNA region consisting of overlapping 
individual BAC clones. By the analysis of about 5000 F2 plants it turned out that in 

30 two individuals the recombination breakpoint situated within the Nod contig flanking 
both sides. This region was about 400 kb containing the Nod" mutation (see 
Example 7.). 
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The BAC clones constituting the 400 kb contig were sub-cloned (see 
Example 6.), and the nucleotide sequence of the Nod region was determined, and 
the genes in this region were identified (see Example 8.). The Nod~ mutant 
5 phenotype could be explained by mutation in some of the genes. Without limitation 
only two genes are mentioned as an example. One of these genes is an ABC 
transporter gene. Mutation in ABC transporter genes are responsible for the 
multidrog resistance phenotype (34) and for the human cystic fibrosis disease (35), 
therefore it is possible that the Nod factor uptake is mediated by ABC transporter 

10 proteins. The other gene is coding for a receptor kinase which" was named as the 
NORK gene. The receptor kinase genes in plants are responsible for the 
perception and transduction of signal molecules (36, 37, 38). The lack of one of 
these functions could lead to Nod" phenotype. However, there are some more 
genes in the Nod region (see Fig. 7.) of which some are listed like the ODF gene, 

15 the MADS box gene, and the lectin genes, which could be responsible for the Nod" 
phenotype if they carry mutation alone or in any combination. 

After the sequencing of the wild type allele of these genes, the mutant 
alleles were also sequenced on such a way, that gene specific primer pairs were 

20 designed, synthesized and used for the amplification of the appropriate DNA 
region of the MN-1008 mutant plant. No nucleotide changes were found in the 
ABC transporter, ODF, MADS box and lectin genes which could be responsible for 
the mutant phenotype. Some allelic variations were found, but these mutations 
could be neutral changes. On the other hand, however, an in frame stop codon 

25 mutation was found in the coding region of the NORK gene originated from the 
MN-1008 mutant (see Fig. 9) which terminated the protein synthesis before the 
naturally occurring termination. As a consequence the mutant would be shorter as 
compared to the wild type protein, and therefore the protein is inactive. According 
to the sequence analysis the mutation (the stop codon) occurred in the kinase 

30 domain in a region which is functionally active and highly conserved. With the help 
of the RT-PCR method and specific primers (see Fig. 9.) NORK specific cDNA 
fragments could be amplified. According to this result NORK specific receptor 
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kinase mRNA was synthesized consequently the NORK gene was transcribed in 
both wild type and mutant plants. 

It is obvious for experts in molecular biology that the mutation in the NORK 
gene (the nn1 mutation which generated an in frame stop codon) leads to loss of 
function because the active site needed for the kinase activity was missing (see 
Fig. 13). It is not excluded, however, that more mutation in the MN-1008 plant 
occurred (like in the ABC transporte gene, ODF gene, MADS box gene, and lectine 
gene), and the mutation leads to Nod" phenotype as well, consequently, these 
genes are protected by this invention too. 

Abbreviations used in this inventions: 



15 



20 



25 



30 



NORK: 
nod genes: 



Nod factor: 



Nod": 
Nod+: 
Diploid: 
Teraploid: 

RFLP marker: 



SHMT: 
BAC: 

BAC vector: 



NOD region specific Receptor Kinase gene. 

Genes found in Rhizobiurh bacteria and which are 

responsible for the synthesis and excretion of the Nod 

factor. 

The lipo-chitooligosaccharide produced by Rhizobium 
bacteria which induces nodule formation on plant roots 
(see Fig. 1 .). 

Phenotype of those plants lacking nodules 
Phenotype of those plants with nodules 
Somatic cells of the plants with double chromosome set 
Somatic cells of the plants with quadruplex 
chromosome set 

Restriction Fragment Length Polymorphism, which is 
used as genetic marker. The lenght polymorphis is 
detected in most cases by DNA-DNA huybridization. 
Serine-hydoxymethyl-transferase 
Bacterial Artificial Chromosome 

Low copy number cloning vector which can accomodate 
large (more than 1 00 kb) DNA fragments. 
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BAC clone: 

BAC library: 

bp: 

kb: 

allele: 

primer: 

contig: 

in frame: 

RT-PCR: 

NAB: 

NBW:. 

RAPD: 



Insert cloned into BAC vector, or both the vector and 
the insert together. 

BAC clones representing the entire DNA content of an 

organism. 

base pairs. 

kilobase pairs. 

gene variant. 

short (5-50 bp long) single stranded oligonucleotide. 
DNA region covered by overlapping clones 
according to the open reading frame 
Reverse transcription PCR, that is amplification product 
of transcribed mRNA. 

one of the F2 family segregating the Nod" character 
the other F2 family segregating the Nod" character 
Random Amplified Polymorphic DNA (46) 



BRIEF DESCRIPTION OF DRAWINGS 

Fig.1. Structure of the signal molecules (Nod factors) produced by 
20 Rhizobium bacteria. Nod factors produced by: A. Sinorhizobium meliloti; B. 
Rhizobium leguminosarum bv. viciae; C. Bradyrhizobium japonicum; D. 
Azorhizobium caulinodans; E. Rhizobium tropici; F. Rhizobium sp. NGR234. 

Fig. 2. F2 alfalfa populations segregating the Nod" phenotype. The w?7/NN1 
alleles are highlighted. 
25 A. Crossing of the parents; highlighting linkage group 5 with the nn1 es NNi 

alleles 

B. The nn1 and MV? alleles inherited by the F1 NAB plant. 

C. Segregating nn1/NN 1 alleles in the F2 population originating from self 
mating of the F1 NAB plant. 

30 D. Generating F3 homozygous individulas from the self mating of the 

appropriate F2 individuals. 
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Fig.3. Genetic mapping of the nn1 gene in the tetraploid and diploid 
populations of Medicago sativa. Markers linked to the Nod" phenotype were 
screened by the Bulked Segregant Analysis with the help of the Nod" F2 NAB 
plants and Operon Primers. The position of the selected RAPD markers were then 
5 determined on the diploid alfalfa map. Finally RFLP markers linked to the RAPD 
markers were picked and mapped in the tetraploid population in relation to the nn1 
gene. 

Fig. 4. Functional and restriction map of vector pBeloBacH . The cloning site 
(H = Hind\\\ ) sits in the /acZ gene. B = BarrMY, CM = chloramphenicol, see ref 
10 (52). 

Fig.5. BAC clones in the Nod contig. Primary clones isolated by the help of 
the Q5E and SHMT markers are highlighted by green. Blue colors represent 
secondary clones isolated with the help of the end sequences of the primary 
clones. „b" and „j" represent the left and right side of the inserts, respectively. 

15 Fig. 6. Determination of the precise position of the nn1 gene with the help of 

two close recombination events. The precise localization of the nn1 gene was 
established with the help of two DNA originating from individuals carrying 
recombination events in the Nod region, as well as with the help of two DNAs 
originating from opposite homozygous individuals. Sub-clone G34P44 originating 

20 from BAC 67A11 was used as hybridization probe for filter containing EcoRV 
digested DNA from NAB4156 (recombinant), NAB814 (homozygous), NAB4443 
(recombinant), and NAB2161 (homozygous) individuals. Nod + allele (highlighted 
by + arrow) could be detected for the two Nod + plants (NAB4156 and NAB 814), 
while Nod" allele for the two Nod* plants (NAB4443 and NAB2161). Sub-clone 

25 G33P135 has been used to identify a Nod" allele for NAB4156. Sub-clone G3P126 
identified Nod" allele for plant NAB4443 after Dral digestion. Sub-clone G18P4 on 
the other hand detected a Nod + allele for this plant after Hindlll digestion. Since the 
two plants displaying recombination were Nod" the nn1 gene can be limited 
between these two recombination points. 

30 Fig.7. Genes and there orientation found in the Nod region. The arrows 

show the orientation of the genes in the Nod region. The lengths of the arrows are 
out of scale. pNORK_Nhe5 is a pUC19 based recombinant plasmid carrying the 
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entire NORK gene on a 8.5 kb long Nhel fragment. The name of the genes are 
given after the abbreviations. The accession number of that gene is given which 
showed the highes BLST score in the NCBI databank. Finally the e value is given, 
which is a generally accepted value representing the level of homology. 
5 Fig. 8. Nucleotide sequence of the 8563 bp long DNA region containing the 

entire NORK gene. The transcription start site (ATG) is at position 1153 bp (left to 
right) form the Nhel site (GCTAGC). The stop codon is at position 663 bp from 
(right to left) the last nucleotide of the second Nhel site: There are 15 exons and 
14 introns in the gene. The position of the exons :and introns are highlighted in 
10 Table 6. (There are 60 nucleotides in one row). 

Fig. 9. List of the oligonucleotide primers used for the amplification of both 
genomic and cDNA sequences. 

A. Name, sequence, length, and start points of the primers. 

B. Schematic representation of the NORK gene with the position of the 
15 primers. 

Fig. 10. cDNA sequence of the NORK gene from Medicago truncatula A17 
(There are 60 nucleotides in one row). 

Fig.11. Alignment of the genomic (see Fig 8.) and cDNA (see Fig. 9.) 

20 sequence of the NORK gene from Medicago truncatula A17. The genomic 

sequence (BAC A17) of the NORK gene identified in Medicago truncatula 
A17 was aligned to the cDNA sequence (cDNS A17). For the genomic 
sequences corresponding to the BAC clones: intron sequences are 
highlighted by red, the rest are black (Promoter region, exons, 5' and 3' 

25 untranslated regions). cDNA sequences are highlighted by blue. Nucleotide 

sequences corresponding to the first two (GT) and last two (AG) nucleotides 
in the introns are double underlined. Start codon (ATG) is green, stop codon 
(TAG) is purple. Nhel sites at the beginning and at the end of the sequences 
are highlighted by orange. 

30 

Fig. 12. Deduced amino acid sequence from the NORK cDNA of M. 
truncatula A1 7 plant. (* - stop codon). 
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Fig.13. Functional representation of the NORK protein by highlighting the 
characteristic domains. 

A. The NORK protein is 925 aa long. The different structural/functional 
5 domains are highlighted by different colors. 

B. More detailed description of the functional domains found in the NORK 
protein. 

Fig.14. cDNA sequence of the NORK gene for the two Nod" alleles 1 and 2, 
10 in plants (NAB1 241/6 and NAB701/28) of Medicago sativa 

Fig. 15. cDNA sequence of the NORK gene for one of the Nod + alleles 
(allele 6) in plant (NAB61 5/28) of Medicago sativa. 

15 Fig. 16. Deduced amino acid sequence of the cDNA for the NORK gene 

from Nod" plants (NAB1 241/6 es NAB701/28) of M. sativa. (* = stop codon). 

Fig.17. Deduced amino acid sequence of the cDNA for the NORK gene 
from Nod + plants (NAB61 5/28) of M. sativa. (* = stop codon). 

20 

Fig. 18. Alignment of the cDNA sequences of the NORK genes originating 
from Medicago truncatula, Medicago sativa, Vicia viilosa and Pisum sativum. 
Medicago alleles are from Medicago truncatula A17 (MtA17), M. sativa Nod" allele 
(Ms1 N") and Nod + allele (Ms6N+), as well as from Vicia viilosa (Vv) and Pisum 

25 sativum (Ps). M. sativa Nod" alleles 1 and 2 were identical. Coding regions are 
grouped in three nucleotide groups according to the genetic code. The position of 
the introns are highlighted by light purple (i1-i14). Nucleotides are highlighted by 
red if they are different from the MtA17 sequence. Mutations creating stop codons 
are shown by arrows. At least two alles are in V. viilosa. Ambiguous nucleotied are 

30 higlighted as: M=A or C; R=A or G; W=A or T; S=C or G; Y=C or T and K=G or T. 
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Fig. 19. Alignment of the deduced amino acid sequence of the NORK gene 
from M. truncatula A17, M. sativa allele 6 and allele 1 as well as from V/c/a villosa 
and Pisum sativum. Medicago alleles are from Medicago truncatula A17 (MtA17), 
M. sativa Nod" allele (Ms1N ) and Nod + allele (Ms6N+), as well as from V7c/a villosa 
5 (Vv) and Pisum sativum (Ps). M. sativa Nod" alleles 1 and 2 were identical. Amino 
acid differed form the MtA17 sequence are highlighted by red. At least two alles 
are in V. villosa. The corresponding amino acids are separated by /. Stop codons 
are highlighted by red. (There are 60 aa in one row). 

10 Fig. 20. RFLP pattern of the NORK gene for the individuals from the diploid 

mapping population to demonstrate the single copy nature of the NORK gene. 
Hybridization pattern of the of the parents (M. sativa ssp. coerulea w2 es h/l. sativa 
ssp. quasifalcata k93), the F1/1 and individuals of the F2 generation on a filter 
containing Dral digested total DNA and hybridized by NORK specific probe (see 

15 Example 10.). The DNA used for probe containes two Dral sites, which are 
probably present in the NORK gene of the diploid species. As a consequence 
there are at least three hybridizing bands. (Other Dral sites are not excluded in the 
intron sequences of the genomic DNA). The 212 bp long Dral fragment probably 
did not transfer to the filter. The 10.5 kb fragments were not polymorphic, as 

20 contrast to the small one which is polymorphic. According to the segregation of this 
fragment there is only one copy of the NORK gene in the diploid alfalfa. 
Mcw2=JWed/cago sativa ssp.coeru/ea w2; MqK93=Med/cago sativa ssp. 
quasifalcata k93 

25 Fig. 21. RFLP hybridization of the NORK gene from tetraploid alfalfa. 

Hybridization pattern of the Nod7Nod + M. sativa individulas from the segregating 
population. Total DNA was digested with Hindlll, the probe contained only the 
extracellular part of the cDNA. On the autoradiogram, there is only one strong 
hybridization band for four individuals, and there are two bands for one individual. 

30 Plant NAB814 carries only alleles 5, and 6, respectively. The rest of the plants 
carry only allele 1 and 2 between the recombination sites (see Example 7.). Allele 
5 and 6 can be distinguished according to the size of the two hybridization 
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fragment in NAB814. On the other hand, allele 1 and 2 are indistinguishable (only 
one hybridizing band). This hybridization pattern indicates that there is only one 
copy of the NORK gene in the tetraploid alfalfa as well. 

5 Fig.22. Hybridization pattern of the M. truncatula total DNA to the NORK 

gene. Hybridization pattern of M. truncatula total DNA digested by EcoRI and 
EcoRV enzymes. The probe ("NORK specific") contained only the extracellular part 
of the cDNA. Hybridization bands represent sequences homologous to the NORK 
gene. 

10 

Fig.23. Hybridization pattern of total DNA from different legumes (Sesbania, 
Cassia, Trifolium es Desmodium) to the NORK gene. The hybridization pattern is 
shown on filters containing total DNA digested by EcoRI, EcoRV, Oral and HindlU 
enzymes. The probe ("NORK specific") contained only the extracellular part of the 
15 cDNA (see Example 10.). Hybridization bands represent sequences homologous 
to the NORK gene. S. r. = Sesbania rostrata; C. e. = Cassia emerginata; T. p. = 
Trifolium pratense; D. sp. = Desmodium species. 

Fig 24. Hybridization pattern of total DNA from different legumes (Vicia, 
20 Trifolium es Melilotus) to the NORK gene. The hybridization pattern is shown on 
filters containing total DNA digested by EcoRI, EcoRV, Oral and HindWl enzymes. 
The probe ("NORK specific") contained only the extracellular part of the cDNA (see 
Example 10.). Hybridization bands represent sequences homologous to the NORK 
gene. V. s. = Vicia sativa; M. a. = Melilotus alba; T. p. = Trifolium pratense; T. i. = 
25 Trifolium incarnatum. 

Fig 25. Hybridization pattern of total DNA from different legumes (Vigna es 
Macroptilium) to the NORK gene. The hybridization pattern is shown on filters 
containing total DNA digested by EcoRI, EcoRV, Dral and HindM enzymes. The 
30 probe ("NORK specific") contained only the extracellular part of the cDNA (see 
Example 10.). Hybridization bands represent sequences homologous to the NORK 
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gene. V. u. = Vigna unguiculata; M. a. = Macroptilium atropurpureunr, V. r. = Vigna 
radiata 

Fig 26. Hybridization pattern of total DNA from different legumes (Pisum, 
5 Glycine, Lotus) to the NORK gene. The hybridization pattern is shown on filters 
containing total DNA digested by EcoRI enzyme. The probe ("NORK specific") 
contained only the extracellular part of the cDNA (see Example 10.). Hybridization 
bands represent sequences homologous to the NORK gene. 

10 Fig 27. Hybridization pattern of total DNA from different non-legumes (rice, 

tobacco, wheat and corn) to the NORK gene. The hybridization: pattern is shown 
on filters containing total DNA digested by EcoRI enzyme. The probe ("NORK 
specific") contained only the extracellular part of the cDNA (see Example 10.). 
Hybridization bands represent sequences homologous to the NORK gene. 

15 

Fig.28. cDNA sequence of the NORK gene from Pisum sativum cv. Frisson 
plant. (There are 60 nucleotides in one row). 

Fig. 29. cDNA sequence of the NORK gene from V7c/a villosa S-1 plant. 
20 (There are 60 nucleotides in one row). The nucleotides represented by the two 
allele of V. villosa are highlighted as follows: M = A or C, R = A or G, W = A ro T , 
S = C vagy G, Y = C or T, K = G or T . 

Fig.30. Deduced amino acid sequence of the cDNA for the NORK gene 
25 from P. sativum cv. Frisson plant. Stop codon (*) is highlighted by red. (There are 
60 aa in one row). 

Fig. 31. Deduced amino acid sequence of the cDNA for the NORK gene 
from V. villosa S-1 plant. At least two alles are in V. villosa. The corresponding 
30 amino acids are separated by /. Stop codon (*) is highlighted by red. (There are 60 
aa in one row). 
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Fig. 32. Antigen-antibody reaction using diluted (1:1000) rabbit serum with 
cell protein extracts from 10 mg plant tissue. A. control tobacco plant (non- 
transformed). B. Transgenic plant carrying the MtA17 NORK gene. C. Nod + control 
alfalfa NAB814. D. Nod - alfafa (MN-1008). 

5 

Fig. 33. Multiple alignment of the NSL proteins. Amino acid residues 
were coloured according to an 80 % consensus. Letters mark the following 
amino acid classes: h = hydrophobic (ACFGHIKLMRTVWZ) and I = their 
aliphatic subset (ILV) with turquoise shading; a = aromatic (FHWY) with 

10 green shading; + = positive (HKR) with red shading; p = polar 
(GNEHKNQRST) in red; t = "turn-like" (ACDEGHKNQRST) in green; s = 
small (ACDGNPSTV) in blue. White letters with dark violet background mark 
amino acids identical (conserved 100%) in all sequences. Amino acids 
present in 80% of the sequences are shown by white letter with pink 

15 background. Stars indicate the positions where the amino acids of the 
legume NORKE domain differ significantly from the consensus of other NSL 
sequences. Mt = Medicago truncatula, At = Arabidopsis thaliana, Pv = 
Phaseolus vulgaris, Os = Oryza sativa, Gm = Glycine max, Gos = 
Gossypum, Zm = Zea mays, TC = tentative consensus sequences 

20 constructed from EST sequences by TIGR ( www.tiqr.org) . 

Table 1 . Germination ability, viability and nodulation properties of the F2 
seeds for the NAB and NBW families. F2 seeds were germinated on 1 % water- 
agar, after germination seedlings were transferred to tubes containing slant agar 
25 (Gibson). After one week plants were inoculated with S. rhizobium 41. After 6 
weeks and 2 months nodulation was evaluated visually. For detailed description 
see Example 1 . 

Table 2. Genotyping the F2 NAB and NBW individuals for markers OPW8a, 
30 OPE8c, OPB13b, OPA6a and for nodulation. Total DNA from the highlighted NAB 
(15 Nod" ; 15 Nod+) and NBW (8 Nod"; 22 Nod+) individuals were amplified by 
OPW8, OPE8, OPB13, OPA6 Operon Primers as described in Example 2. During 
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the amplification reaction fragments (RAPD fragments) corresponding to markers 
OPW8a, OPE8c, OPB13b and OPA6a either appear (coded by number 5) or not 
(coded by number 3). Genotypes of the Nod* and Nod" individuals are coded by 
number 5 or 3, respectively. Genotypes in the table are highlighted according to 
5 the concept of colormapping, colors without number are predicted genotypes (see 
ref. 39). 

Table 3. Genotyping the OPW8a, OPE8c, OPB13b, OPA6a markers on the 
diploid mapping population. Genotyping of the maternal and parternal alleles 

10 corresponding to the hybridizing bands were done as follows: maternal 
homozygous = 1 (yellow); paternal homozygous = 3 (purple); hetrozygous = 2 
(green); maternal dominant = 5 (light green); paternal dominant = 4 (blue);. missing 
genotypes - 0 (grey). Colormapping was accordng to Table 3, prediction according 
to ref. 39. Colormaps show the U584B-Lblll region of linkage group 5 of the diploid 

15 M. sativa ssp. coerulea and M. sativa ssp. quasifalcata genetic map (40). 3A and 
3B highlights the non-predicted and the predicted map, respectively. 

Table 4. Genetic mapping of markers U71, U224, CG13, SHMT, Q5E es 
U492 on the tetraploid alfalfa population. Genotypes of the individuals plants were 

20 according to the size of the hybridizing band. If the band was similar to the size of 
the band from the Nod" parent, the genotype code was 3 (purple). If the band was 
similar to the size of the band from the Nod + parent, the genotype code was 5 
(light green). Genotypes of the Nod + and Nod" individulas are coded by number 5 
or 3, respectively. Genotypes in the table are highlighted according to the concept 

25 of colormapping, colors without number are predicted genotypes (see ref. 39). 

Table 5. List of the sub-clones originating from BAC clones 28112, 50E23, 
2D11 es 67A11. The size of the insert in the sub-clones are also shown. For 
detailed description see Example 6. kb = kilobase pairs. 

30 

Table 6. Position of the exons and introns in the NORK gene according to 
Fig.9. Numbering starts from the first nucleotid of the Nhel site (see Fig. 8.). 
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Table 7. Cloning of the NORK gene into plant cloning vectors. The 
description of the vectors used for plant trasnformation and the details of the 
cloning is given in Example 12. 

5 

Table 8. List of the Agrobacterium strains carrying the NORK gene. The 
construction of the Agrobacterium rhizogenes and A. tumefaciens strains used for 
the two different transformation protocols are given in Examples 12A and 12B, 
respectively. 

10 

Table 9. List of the A. rhizogenes strains used for transformation and the 
transformation frequency. Nod" plants were infected by A. rhizogenes strains as 
described in Example 12B. After emergence of the roots GUS staining was 
performed and the nodulation ability was tested. The second and the third row in 
15 the Table highlights the results obtained for the two transformation methods 
(combined). In brackets, the results from the second experiment are shown. 

Table 10. Summary of the transformation experiments with Medicago sativa 
and Agrobacterium tumefaciens. 

20 Table 11. Summary of the transformation experiments with Nicotiana 

tabcum, Vicia villosa and Agrobacterium tumefaciens. 

BEST MODE OF CARRYING OUT THE INVENTION 

25 Hereinafter the invention is demonstrated by examples. These examples do 

not restrict the protected domain of the invention they are only for demonstration 
purposes. 

Example 1 

30 

Genetic crosses and the analysis of F2 progeny of the NAB and NBW 
families 
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The tetraploid (2n=4x=32) Medicago sativa MN-1008 mutant line (28) has 
been provided us by Prof. D. Barnes (University of Minnesota, MN, USA) 
(primarily called MnNC-1008 (NN), but generally abbreviated as MN-1008), is 
5 unable to form symbiotic nodules in the presence of compatible bacteria. The 
mutant plant emerged spontaneously by means of crosses between cultivated 
alfalfa populations and subsequent self-matings (27). One individual (referred in 
laboratory notebook as MN-1008/17) of the purple flowered MN-1008 mutant 
alfalfa line has been used as maternal parent in genetic crosses. The paternal 

10 parent was one individual of the M. sativa cv. Nagyszenasi (referred as MsNa/5) 
alfalfa population. Since the self-pollinating efficiency of the MN-1008/17 maternal 
parent was between 10-20 %, its flowers had to be sterilized (emasculated). Prior 
to crosses the so-called banner petals of the matured flowers of the maternal 
parents were trimmed by scissors and followed by splitting open of the flowers by 

15 squeezing them to avoid the anthers to hit the stigma. Then the flowers were 
immersed in 51 % ethanol for 3-4 sec and then washed in water. After the flowers 
dried the stigma was covered with the pollens from the male parent. The efficiency 
of the ethanol treatment was checked in control experiments: the self-pollinating 
efficiency of the treated flowers in the absence of foreign pollen was about 0.5 %. 

20 The F1 progeny seeds coming from the above described cross were 

vernalized at 5-8 °C for two weeks, then germinated and planted out first into 
plastic pots with diameter 8 cm, and after the plants gained strength into pots with 
diameter 25 cm containing flower-soil termed NT. Plants are maintained to bloom 
in greenhouse at 22-25 °C with photoperiod of 16 hours lighting and 8 hours dark. 

25 The flowers were self-pollinated one by one manually by squeezing the base of 
the flowers (tripping). At this time the stamen knocked against the stigma and the 
pollens got into the stigma from the anthers. After self-pollination the F2 ovules 
appeared on the plants from which the matured seeds developed. The F2 
generation were begun to nurse in symbiotic test to determine their symbiotic 

30 phenotype. The matured seeds were vernalized and their surface were sterilized 
by rinsing them in 70% ethanol for 30 sec and treating with 0.1% HgCI 2 solution 
for 5 min followed with rinsing in sterile distilled water for five times. Then the 
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seeds were swelled in sterile water for 8 hours. After swelling, the seeds were 
placed in equable distances from each other on the surface of plates in Petri 
dishes containing 1% agar in distilled water. The dishes were incubated upside 
down in dark to prevent the emerging roots to grow into the agar medium. Next 
5 day the seedlings with roots ofabout 0.5 - 1 .0 cm length were then placed on the 
surface of 1% agar slants with nitrogen-free Gibson medium (41) and the plants 
were grown at 22 - 25 °C with 16-h photoperiod (10 000 Lux). The seedlings were 
inoculated with Sinorhizobium meliloti 41 bacteria culture suspended in 1ml 
Gibson medium after 5-7 days of transfer. 

10 The S. meliloti 41 bacteria culture were prepared for inoculate as follows. 

An inoculation loop of S. meliloti 41 strain (42, 43) stored at -80 °C were 
inoculated into test tube containing 5 ml YTB medium (44). The bacteria culture 
was rotated in test tube for 24 hours. The grown-up culture were inoculated into a 
500 ml flask containing 100 ml YTB medium and the culture were grown up to 

15 stationer phase for about 24 hours. The concentration of the bacteria suspension 
grown-up to stationer phase was between 1-3 x 10 9 cells/ml. The bacteria were 
collected by centrifugation (in centrifuge Sorvall RC5 with GSA type angle rotor at 
4 °C with 10 krpm for 10 min) under sterile conditions. The supernatant was 
discarded and the cells were resuspended in 100 ml Gibson medium (41). The 

20 concentrated bacteria culture was diluted in 100-fold in Gibson medium before 
inoculation. After dilution the concentration of the bacteria was 1-3 x 10 7 cells/ml. 
The nodulation phenotype of the plants was scored 6 weeks after inoculation, 
afterwards plants were potted into soil. Plants were removed gently out of the pots 
and were evaluated once more for the presence or absence of nodules 2 months 

25 later (secondary control of nodulation). The plants were planted back into pots and 
grown further. The flowers of the F2 individuals selected later by genetic analysis 
(determination of the genotypes for different molecular markers) were self- 
pollinated as described above to generate F3 seeds which were tested for 
symbiotic nodulation. 

30 After the crosses 62 F1 seedlings were planted and grown up. Forty-four of 

these plants produced sufficient flowers and F2 seeds (at least 50-100 
seeds/individuals) after self-pollination to test their viability and analyze the 
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segregation of symbiotic nodulation phenotype. Two F1 plants (NAB and NBW) 
producing the most vigorous F2 progeny in which the Nod" phenotype could be 
evaluated were selected for further experiments. The germinating, viability and 
nodule forming characters of NAB and NBW F2 seeds are listed in Table 1 . The 
5 segregation ratio of the Nod" and Nod + characters was between 1 :50 and 1 :55 that 
is comparable to the segregation ratio of a single, recessive character in a 
tetraploid F2 segregation population (theoretical value is 1:35). 

Composition of plates containing 1 % agar in distilled water: 
10 10 g Bacto agar 

1 liter distilled water 
Sterilization for 30 min at 121 °C 

Composition of flower-soil termed NT: 
15 1 part Fiorina flower-soil (commercially available) 

1 part peat (commercially available) 
1 part sand from river Maros (commercially available) 

Composition of YTB medium: 
20 1% Bacto Tryptone 

0.1% Yeast extract 
0.5% NaCI 
1 mM MgS0 4 
1 mM CaCI 2 

25 

Example 2 

Identification of linked RAPD markers to the nn i gene of the Nod" MN-1008 
mutant plant using the NAB and NBW F2 families 

30 

The identification of tightly linked markers to the nni gene, that is to the Nod" 
character was performed with the so-called Bulked Segregant Analysis method 
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(45). To perform the screen five Nod" and five Nod + plants were selected from the 
NAB and NBW families, respectively (Nod" NBW plants: NBW6, NBW20, NBW37, 
NBW162, NBW790; Nod + NBW plants: NBW9, NBW23, NBW74, NBW440, 
NBW718; Nod" NAB plants: NAB65, NAB267, NAB637, NAB701, NAB908; Nod + 
5 NAB plants: NAB26, NAB82, NAB139, NAB421, NAB814). The analysis of two F2 
families enhanced the possibility to identify more linked markers. Since alfalfa is 
an outcrossing plant, the genome is heterozygous carrying different alleles for the 
genes. Analysing the inheritance of the alleles in a tetraploid population (as shown 
in Figure 2.) demonstrated that NAB F1 individual contained two Nod" alleles 

10 (indicated as 1 and 2) and two Nod + alleles (indicated as 5 and 6), while it did not 
carry the two other Nod + alleles (indicated as 7 and 8). As it turned out later, the 
analysis of the NBW family allowed us to detect polymorphism of these Nod + 
alleles (indicated as 7 and 8) and to identify additional linked molecular markers. 
Total DNA was isolated from young leaves of the Nod" and Nod + plants 

15 using the QIAGEN Plant DNA Isolation Kit (DNeasy Plant Mini Kit; QIAGEN Inc. - 
USA; 28159 Avenue Stanford; Valencia CA 91355) according to the supplier's 
instructions. The concentration of the DNA samples were determined by 
spectrophotometer (OD 2 6o= optical density measured at 260 nm) (1 OD 2 6o = 50 
//g/ml DNS). The DNA samples were diluted to a concentration of 5 ng/jA, and the 

20 DNA samples of five Nod" and Nod + plants were mixed, to produce the Nod" and 
Nod + homozygous groups. The mixed DNA samples of the homozygous groups 
were used as templates in polymerase chain reactions (PCR): 12.8 [A sterile 
distilled water, 5 //I DNS, 2.5 [i\ 25 mM MgCI 2 , 2.5 ji/l 10x Taq polymerase buffer 
(Zenon Kft, H-6720. Szeged, Berzsenyi u. 3.), 2 //I Operon Primer (2.5 pmol/fjJ), 

25 and 0.2 jj\ Taq DNS polymerase (5 U/yc/l) (Zenon) were pipetted into PCR tubes of 
500 fj\ volume. The PCRs were carried out in 40 cycles with the following steps: 5 
sec at 94 °C, 1 min at 37 °C and 1 min at 72 °C. After completion of the PCR 
amplification, 5 //I AGE (AGE = 20 % sucrose, 0.1 % bromophenol blue, 0.1 % 
xylene cyanol) stain was added to the samples and then the amplified products 

30 were separated in 2 % agarose gel (Sigma Low EEO) of 1 5 x 20 x 0.4 cm with 2 
V/cm voltage for three hours in 1x TEA (40 mM Tris, pH 7.8, 50 mM EDTA, 50 mM 
acetic acid) buffer. After separation of the fragments, the gel were stained in 0.1 % 
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ethidium bromide solution and exposed to UV light to take pictures. The first 520 
10-mer primers of Operon Primer set (Operon Technologies, Alameda, Ca, USA) 
with different nucleotide composition were tested to amplify of the DNA of the two 
homozygous bulks for both the NAB and NBW families. The PCR amplifications 
5 with these short primers resulted in fragment pattern typical for the given primer. 
The detected differences in the pattern (polymorphism) could be used as RAPD 
molecular markers (46). As a result of the tests of the primers of the Operon 
Primer set, several amplified fragments could be identified that were amplified on 
the bulk of the Nod + DNA but not on the bulk of the Nod" DNA. Four out of them 

10 were analysed thoroughly: OPA6a (identified in NBW family), OPW8a (identified in 
NAB family), OPE8c es OPB13b (identified in both families). The sizes of the 
amplified products identified in the bulks of the Nod + plants were as follows: 
OPA6a, about 650 bp; OPW8a, about 300 bp; OPB1 3b, about 850 bp; OPE8c, 
about 480 bp. Afterwards the the individuals DNAs were re-tested in PCR to 

15 identify the genotypes. In addition other individuals from the families (15 NAB and 
8 NBW Nod" plants, as well as 15 NAB and 22 NBW Nod + plants) were tested for 
the same four RAPD markers. The result of the genotyping (thegenotypes) was 
displayed in color-map (39) in Table 2. The results presented in the table 
demonstrated unambiguously the linkage between the nni gene and the four 

20 RAPD markers and determined the genetic order of the markers. 

For fine mapping, the NAB family has been selected since it was the largest 
F2 population. The F2 individuals with diallelic configuration have been selected 
from this family to produce F3 individuals by self-mating (see Figure 2.). Individuals 
with homozygous genotypes for the different alleles were searched by 

25 determination of their genotypes. Three F3 plants were selected: the plant 
designated as NAB1 241/6 that is Nod" and contains the alleles labelled with no. 1 
in homozygous configuration, the Nod" plant designated as NAB701/28 containing 
the alleles labelled with no. 2 in homozygous configuration around the genomic 
region of the mutation and the Nod + plant designated as NAB615/28 carrying the 

30 alleles labelled with no. 6 in homozygous configuration. No individuals could be 
detected containing homozygous alleles labelled with no. 5 in this region. 
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Example 3 

Genetic mapping of QPW8a, OPE8c. QPB13b es az QPA6a RAPD markers 
on the linkage map of diploid alfalfa (M. sativa) 

5 

In order to determine the map position of the nn^ gene on the linkage map 
of alfalfa, the closely linked RAPD markers (markers OPW8a, OPE8c, OPB13b 
and OPA6a) were mapped on the genetic map of diploid alfalfa (40). To execute 
this experiment, the genotypes of the individuals of the diploid segregation 

10 population had to be determined. DNA fragments were amplified as described 
above using the OPW8, OPE8, OPB13 and OPA6 primers, respectively and the 
total DNA of NAB814 and,NBW9 plants as template DNA: After separation, the 
DNA fragments corresponding to markers OPW8a, OPE8c, OPB13b and OPA6a 
were isolated from the agarose gels, purified by the QIAGEN Fragment Isolation 

15 Kit (QIAEX II Gel Extraction Kit; Cat.No. 200 51 QIAGEN Inc) and their 
concentration determined by spectrophotometer (determination of OD at 260 nm). 
Fifty ng of the fragments were labelled with deoxycitidine-5'-[alfa- 32 P] triphosphate 
(Code: FP-205, Izotop Intezet Kft, 1121 Budapest, Konkoly Thege Miklos ut 29- 
33.) using the Ready-To-Go DNA labelling (-dCTP) Kit (cat. number: 27-9240-01; 

20 Amersham Pharmacia Biotech Inc, Piscataway, NJ, USA) according to the 
suppliers' instructions. The HybondN* nylon filters (cat. number: RPN3003B, 
Amersham Pharmacia Biotech Inc, Little Chalfont, Buckinghamshire, England) 
carrying the restriction enzyme digested DNA of the individuals of the diploid 
mapping population were hybridized with the hybridization probes prepared as 

25 described above. The filters were prepared by the following method: total DNA was 
isolated of the individuals of the F2 mapping population (47) as described in 
Example 2. Aliquots of 20 //g DNA were digested with Dral, EcoRI, EcoRV and 
H//7dlll restriction enzymes (Fermentas AB, Vilnius, Lithuania), respectively and the 
restriction fragments were separated in 0,8 %-os agarose gels as described in 

30 Example 2. The fragments were transferred onto HybondN + nylon membranes 
according to the suppliers' instructions and the filters were hybridized at 60 °C for 
16 hours in CG buffer published by Church and Gilbert (48). After hybridization, 



SUBSTITUTE SHEET (RULE 26) 



WO 02/102841 



30 



PCT/HU02/00055 



washing was carried out three times for 20 minutes each at the temperature of the 
hybridization in the following solutions: 1.) 0.1% SDS and IxSSPE (49); 2.) 0.1% 
SDS and 0.5xSSPE; 3.) 0.1% SDS and O.lxSSPE. Autoradiography of the filters 
was at -80 °C using the Sterling Diagnostic (Sterling Diagnostic Imaging Inc., 
5 Newark, DE 19714 USA) X-ray films or the filters were exposed in the exposure 
casette of the phosphorimager scanning instrument (Phosphorimager™ 445 SI; 
Molecular Dynamics Inc., Sunnyvale, CA, USA) and after the scanning of the 
screens the results were analysed by the ImageQuant computer software. The 
hybridizing fragments corresponding to the alleles of the OPW8a, OPE8c, OPB13b 

10 and OPA6a markers were identified as RFLP bands and the genotypes of the 
individuals were determined. The map position of the markers were determined by 
the colormapping procedure (39): all of the four markers were mapped in linkage 
group 5 (LG5) close to the U224 marker. The genotypes of the OPW8a, OPE8c, 
OPB13b and OPA6a markers of the individuals in the diploid mapping population 

15 and the results of the genetic mapping are presented in Table 3. 
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Example 4 

Genetic mapping of the RFLP markers between U584B and Lblll markers in 
the tetraploid population segregating the nn i Qene 

5 

To ensure that the map position identified in the diploid population with 
molecular markers linked in fact to the nm gene (to the Nod" pnenotype) the map 
position of some markers mapped previously in this region (between the U584B 
and Lblll markers) were determined in the tetraploid alfalfa genetic map. The 

10 segregation of five RFLP markers (U71, U224, CG13, SHMT and U492) and a 
specific PCR marker (marked with Q5E) generated by cloning the amplified 
fragment (preparing the pQ5E palsmid) of the RAPD marker designated as 
OPQ5E were analyzed in the NAB population segregating the nm gene. 

Preparation of pQ5E plasmid: PCR amplification was performed using 50 ng 

15 total DNA of NAB814 plant (see the preparation of plant total DNA in Example 2) 
and OPQ5 primer as described in Example 2. The amplified products were 
separated in agarose gel (see Example 2) and the 1750 bp-long fragment 
corresponding to the Q5E marker were excised, reisolated from agarose gel and 
purified as described in Example 3. After adding 5.5 //I 10x Klenow buffer to the 50 

20 y\ eluated DNA solution, fragments were treated with 0.5 //I (5 unit) Klenow 
fragment of the DNA polymares I according to the suppliers' (Fermentas AB) 
instructions. Then 4 fj\ (1 jjg) H//?cll-digested [prepared according to suppliers' 
(Fermentas AB) instructions], CIP(Calf Intestine Alkaline Phosphatase)-treated 
(dephosphorilation) and purified pUC19 plasmid (49), 7 /j\ 10x ligase buffer and 3 

25 /j\ (15 U) T4 DNA Ligase enzyme (Fermentas AB) were added to the fragments. 
The ligation was performed at 14 °C for 24 hours. After this the ligation was 
transformed into 250 //I E. coli DH5ar kompetent cells (Amersham Pharmacia 
Biotech AB, Uppsala, Sweden) using the CaCI 2 procedure (50). After induction of 
the bacterial gene which resulted in resistance against ampicillin (phenotypical 

30 expression), the bacteria were plated on LB plates containing ampicillin (100 
//g/ml) and incubated at 37 °C for 16 hours. Plasmid DNA was isolated from the 
resistant colonies by QIAGEN plasmid isolation kit (QIAGEN Plasmid Mini Kit; Cat. 
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No. 12123 QIAGEN Inc.). One clone was selected by PCR amplification from the 
clones carrying the 1750 bp-long amplified product: BRC1464 = E. coli DH5a 
(pQ5E). The sequence of the insert was determined and specific primers were 
designed based on the sequence: 
5 Q5EU1 : 5'-CGGCGTCTTGCTAAAGGAGA-3'; 

Q5ED1 : 5-TATTCGACTCATCATGGTTA-3'; 
Q5EU2: 5-GTCAAATATCGATTCGTGAT-3'; 
Q5EU3: 5-CTAAG AAAG GCTTTGGTTG G-3'; 
Q5ED3: 5'-TCACGAATCGATATTTG ACA-3'). 

10 Using the specific primers in PCR amplification, the dominant OPQ5E 

marker was transformed into codominant Q5E marker and its more precise map 
position was determined in relation to the other markers. 

The map position of markers U71, U224, CG13, Q5E es U492 have been 
published earlier (40). The SHMT marker (51) mapped between markers Q5E and 

15 U224 on the genetic map of diploid M. sativa (Kalo et al. manuscript in 
preparation). Markers U71, U224, CG13, SHMT and U492were mapped as RFLP 
markers using the F2 NAB individuals, while the marker Q5E was mapped by 
specific PCR (94°C 30sec, 60 °C 1 min, 72 °C 1 min in 40 cycles) as well as by 
RFLP in the tetraploid population. The inserts of the clones (47, 51) corresponding 

20 to RFLP markers and pQ5E clone were reisolated, labelled, hybridized, washed 
and autoradiographed according to the method described in Example 3. The filters 
were selected for hybridization containing the Oral, EcoRI, EcoRV and Hind Ill- 
digested total DNA of the selected Nod'es Nod + NAB individuals. The genotypes of 
the individuals were determined for markers U71, U224, CG13, SHMT, Q5E and 

25 U492 by the hybridization signals corresponding to the different alleles. The order 
of the genes shown in Table 4 was determined by the colormap procedure. The 
summary of the genetic mapping of the nn 1 gene displaying the steps from the 
identification of the linked markers to the confirmation of the map position in the 
tetraploid population is presented on Figure 3. Analyzing the mapping data, it 

30 turned out that the markers SHMT and Q5E were the most tightly linked markers to 
the nni gene. No recombination could be detected in the individuals of the 
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analyzed population between markers SHMT and nni. One recombination could be 
detected between markers /?/?* and Q5E in the analyzed individuals. 
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Example 5 

Identification of overlapping BAG clones, constructing the contiq 

5 The primary BAC clones (67A11, 2D 11 es 20K10) were isolated by two 

molecular markers (SHMT and Q5E, see Figure 3.) most tightly linked to nr>t 
mutation in the following way: the Q5E fragment was isolated from plasmid pQ5E, 
while the SHMT fragment was excised from the plasmid containing the cDNA of 
the gene with Kpn\ restriction enzyme (51). The fragments were reisolated after gel 

10 electrophoresis, labelled and the filters were hybridized, washed and 
autoradiographed as described in Example 3. The filters contained more than 30 
000 clones of the BAC library constructed from the total DNA of Medicago 
truncatula a close relative of Medicago sativa, (53). The BAC clones in the library 
contained about 100 kb inserts of the total DNA of M. truncatula . Cloning was 

15 done by H/ndlll digestion of the total DNA and cloned into the Hind\\\ restriction 
sites of the pBeloBACII vector (52) (Figure 4.). The filters and clones of the BAC 
library are available in a public collection (Clemson University Genome 
lnstitute,.100 Jordan Hall, Clemson 29634-5727, South Carolina, USA). 

Two primary BAC clones (clones 67A11 and 2D11) was identified by 

20 hybridization with SHMT fragment, and one primary BAC clone (20K10) with the 
Q5E fragment. An inoculation loop of the BAC clones 67A1T, 2D11 and 20K10 
stored in the clone collection at -80 °C were taken from the stock and inoculated 
into 500 ml flask containing 100 ml LB media and chloramphenicol in 15 A/g/ml 
concentration, respectively. The clones were grown to stationer phase at 37 °C in a 

25 shaker (250 rpm). The plasmid DNA of BAC clones were purified by the QIAGEN 
Large Construct Kit according to the suppliers' instructions. The plasmid DNA of 
the three clones were subjected to single EcoRV and double EcoRV/Not\ digestion 
and the fragments were separated in agarose gels. Based on the pattern of the 
single and double digestion, the EcoRV end-fragments of the clones could be 

30 selected. From these fragments the end fragment of the inserts could be detected 
by Hind\\\ digestion. These end-fragments were used for chromosomal walking, 
that is to identify overlapping BAC clones. The H/ndlll end-fragments of the clone 
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67A11 were approximately 3300 es 800 bp, of the clone 2D11 were 3300 and 
4000 bp and of the clone 20K10 were appr. 1080 es 1400 bp length, respectively. 
The six end-fragments were reisolated from agarose gel and repeated 
hybridization was performed to identify and isolate the secondary BAC clones 
5 (50E23, 64B5, 28112, 6A5). Based on the restriction digestion pattern (H/ndlll, 
EcoRl, EcoRV) of the primary and secondary BAC clones, the degree of the 
overlap of the clones could be determined and the position of the clones to each 
other could be established. In this way the Nod contig could be constructed (see 
Figure 5.). 

10 

Example 6 

Subcionina of the Nod contig 

15 The result of the genetic mapping revealed that the SHMT marker could be 

the closest marker to the nni gene therefore the chromosomal region around the 
SHMT was analyzed first thoroughly by delimiting the genomic region containing 
the gene and by determination of the sequence of this region. In parallel 
experiments the subcloning of two BAC clones (67A11, 2D11) identified by SHMT 

20 and the subcloning of clones 50E23 es 28112 overlapping with 2D11 clone were 
needed. The subcloning was performed in three ways. On one hand (A) the ends 
of the BAC inserts in the contig were cloned, on the other hand (B) the 1-2 kb 
length fragments of the clones 67A11, 2D11 and 50E23 generated by sonicating 
were cloned, so random subcloning was performed, and on the third hand (C) 

25 cloning was realized after restriction enzyme digestion. 

Example 6A 

Subcionina of the end-fragments of the BAC inserts 

30 

The end-fragments of the BAC clones were idenified as described in 
Example 5 and the Hind\\\ fragments were cloned into pUC19 plasmid in the 
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following way: 4 /j] (1 jjg), /-//ndlll-digested and dephosphorylated pUC19 plasmid 
(Fermentas AB), 14 fj\ 5x Ligase buffer, and 2 //I (10 unit) T4 DNA Ligase enzyme 
(Fermentas AB) were added to the DNA fragments purified with the QIAGEN DNA 
Isolation Kit (see Example 3) and eluated in 50 //I buffer. The ligation was executed 
5 at 14 °C for 24 hours. The ligation was transformed into 250 //I E. coli DH5or 
competent cells using the CaCI 2 procedure (see Example 4). The transformant 
colonies were checked by the restriction enzyme digestion of the plasmid DNA 
isolated with QIAGEN plasmid DNA Isolation Kit (see Example 3). 

10 Example 6B 

Random subcloninq of the inserts of the 67A11. 50E23 and 2D11 BAC 

clones 

15 20 jug DNA of the clones 67A1 1 , 50E23 and 2D1 1 in 200 jj\ IE buffer (1 mM 

EDTA, 10 mM Tris-HCI, pH7.5), respectively, were sonicated for 10 sec with the 
smallest head of the MSE Ultrasonic Disintegrator equipment. The fragmentated 
DNA was separated in agarose gel and the 1-2 kb length fragments were excised, 
reisolated and purified as described in Example 3. The DNA fragments generated 

20 in this way were ligated into H/ncll-digested (Fermentas AB) and dephosphorilated 
pUC19 plasmid as described in Example 4 and then they were transformed into E. 
coli DH5a competent cells (see Example 6A). Plasmid DNA were isolated from the 
transformant colonies with QIAGEN Plasmid Isolation Kit and their contents for 
inserts were checked by electrophoresis in 1 % agarose gels. 

25 

Example 6B 

Subcloninq of the 67A11. 2D11. 50E23 and 28112 BAC clones after 
restriction enzyme digestion 

30 

The plasmid DNA of BAC clones 67A11, 2D11, 50E23 and 28112 were 
purified as described in Example 5 and 1 //g purified plasmid DNA were digested 
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with Nhe\, Xba\, Hin2\, Hin6\, Tail, Taq\, BamH\, Hindlll, EcoR\ and Sai/IIIA1 
enzymes (Fermentas AB), respectively according to the suppliers' 
recommendation. The Nhel and Xba\ fragments were cloned into Xfcal-digested 
(the Nhe\ and Xbal result in compatible sticky ends) and dephosphorylated pUC19 
5 vector, the Hin2\, HinQl, Tai\, Taq\ fragments were cloned into Accl-digested and 
dephosphorilated pUC19 vector, the Hindlll and EcoRI fragments were cloned into 
Hindlll- and EcoRI-digested and dephosphorilated pUC19 vector and the BamHl 
and Saul 1 1 A1 fragments were cloned into BamHl-digested and dephosphorilated 
pUC19 vector. After digestion the restriction fragments were purified by the 

10 QIAGEN DNA Isolation Kit (see Example 3), ligated into the corresponding 
enzyme-digested (Nhel, Acc\, HindlW, EcoRI and BamHl) and dephosphorilated 
pUC19 plasmid and transformed as described in the Example 6A. The 
transformant colonies were checked by digestion with the corresponding restriction 
enzyme after plasmid DNA isolation as described above (Example 6A). 

15 The recombinant clones and their insert sizes generated with subcloning are 

presented in Table 5. 

Example 7 

20 Determination of the map position of the gene more precisely in the 

contig by identification of the closest recombination sites at both sides of the 
mutation 

The generation of the subclones from the BAC clones overlapping the nni 
25 mutation and the determination of the map position of the nn-i gene more precisely 
by this subclones were done parallelly. The generated subclones were used as 
genetic markers to determine the recombination sites more precisely. After having 
the recombination sites more precisely it was obvious that the detailed sequence 
analysis for BAC 67A11 and 2D11was necessary. 
30 Two Nod" plants (NAB4156 es NAB4443) carrying recombination close to 

the nni mutation and two other plants that did not contain recombination close to 
the Nod region but they were opposite homozygotes (Nod + NAB814 and Nod" 
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NAB2161) for the Nod character were used for genetic experiments. The clones 
G34P44 and G33P155 coming from BAG clone 67A11, the clone G3P126 coming 
from 2D11 and clone G18P4 arising from 28112 BAC clone were converted to 
molecular markers. The inserts of the clones were excised with the proper 
5 restriction enzyme, the fragments were reisolated and used as hybridization 
probes to the filters containing the EcoRV-, H/ndlll- and Dral-digested total DNA of 
above mentioned four plants. From the experimantal results (see Figure 6) it could 
be concluded that the position of the nn? gene was delimited on a genomic region 
(Nod region) by two recombination detected by fragment G34P44 located on BAC 
10 clone 67A1 1 and by fragment G18P4 located on BAC clone 28112. 

Example 8 

Determination and analysis of the sequence of the Nod region 

15 

In order to determine the sequences of the subclones obtained as described 
in Example 6, the ABI PRISM Dye Terminator Cycle Sequencing Ready Reaction 
Kit and the ABI PRISM BigDye Terminator Cycle Sequencing Ready Reaction Kit 
(Perkin Elmer Applied Biosystems; 850 Lincoln Centre Drive Foster City, CA94404 

20 USA) was used according to the manufacturer's recommendation. The plasmid 
DNA for templates of the seqeuncing reaction were isolated by QIAGEN Plasmid 
Isolation Kit and their concentration were quantified using spectrofotometry (see 
Example 3.). The sequences of the amplified products labelled by flurorescent 
dyes were determined by ABI 373 and ABI 377 automated sequencer (Perkin 

25 Elmer Applied Biosystems; 850 Lincoln Centre Drive Foster City, CA94404 USA). 
Authenticity of the DNA sequences: the sequences were determined on both 
strand, in one direction the reactions were repeated 2-8 times, based on 
independent templates in ambiguous cases. The sequence data were stored in 
computer and their analyses were started with ordering them into overlapping 

30 sequences. The overlapping end-fragments of the BAC clones and of the 
subclones generated by restriction digestion of these BAC clones could help to 
assemble the sequences and to arrange the subclones generated by random 
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fragmentation or restriction enzyme digestion and the BAC clones to each other. 
By the help of the available sequences, homologue genes were searched in the 
NCBI (National Center for Biotechnology Information 
http ://www. ncbi. nlm . nih .qov/BLASTA and the Arabidopsis 

5 (http://www.arabidopsis.org ) databanks. Taking into consideration of the homology 
between the sequences, the common and general characters of the gene 
structures [consensus sequences as initiation and termination codons, open 
reading frame (ORF), consensus sequences characteristics of exons and introns 
such as GT-AG rule, the point of divergence, etc.] the starting- and end-point of 
10 the coding regions of the genes located in the Nod region were searched and the 
order and the orientation of the genes were determined. The results of this 
sequence analysis are presented schematically on Figure 7. 

Example 9. 

15 

Detailed analysis of the NORK gene 

The Nod" phenotype of the mutant plant can be explained above all 
with a mutation in the NORK (NO D region linked Receptor Kinase) gene coding for 

20 a receptor kinase, that is why the sequence analyis of the wild type NORK gene on 
BAC clone 67A1 1 was carried out first. The NORK gene is located on an Nhel 
fragment of 8563 bp length and its nucleotide sequence - from the Nhel site to the 
next M?el site - is shown in Figure 8. This nucleotide sequence identify the 
genomic sequence of the NORK gene from the M. truncatula A17 plant (the BAC 

25 library was constructed from this plant). To reveal the exact structure of the gene 
we have to know the exon/intron borders/positions which can be predicted from the 
genomic sequence based on the GT-AG rule and the branching point, however, it 
can be determined exactly only by knowing the cDNA sequence. For this purpose 
we isolated RNA from the roots of M. truncatula line A17, then we prepared cDNA 

30 from the mRNA by reverse transcription. cDNA samples were produced by PCR 
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amplification using NORK specific primers(see Figure 9.), then the amplified 
fragments were sequenced either directly or after cloning as desribed in Example 
8. 

5 Example 9 A. 

Construction and sequencing of NORK specific cDNA sequences 

the seeds of M. truncatula A17 were germinated after sterilization and 

10 planted into sterile soil. From the roots of the two-three week old plants the cDNA 
representing the NORK mRNAs as well, was prepared as follows. The roots of the 
plants were pulverized/ground under liquid nitrogen, the powder was solubilized in 
RNA extraction buffer (8 M guanidinium chloride, 20 mM MES, 20mM EDTA, 50 
mM /?-mercaptoethanol), then purified by extraction with a mixture of phenol- 

15 chlorophorme-isoamil-alcohol (25:24:1). After centrifugation the supernatant was 
precipitated at -20 °C for 1 hour with 0.7 volume of 95% ethanol and 0.2 volume of 
1 M acetic acid. After centrifugation the precipitated RNA was dissolved in DEPC 
(diethyl-pyrocarbonate)-treated distilled water. The cDNA was prepared from the 
RNA with the help of the RevertAid H Minus First Strand cDNA Synthesis Kit 

20 (Fermentas AB). The different RNS samples were treated with RNase free DNase 
(Fermentas AB) and then the cDNAs were synthetized with the help of oligo-dT 
primer and reverse transcriptase enzyme (Fermentas AB) in the presence of 
RNase inhibitor (Fermentas AB) at 42 °C. Overlapping fragments of the NORK 
cDNA were amplified from the prepared cDNA with different combinations of the 

25 primers shown in Figure 9. The PCR conditions were as follows: 95 °C for 30 
seconds, 55 °C for 1 minute, 72 °C for 1 minute, these steps were repeated 35- 
times. The specific NORK fragments amplified by RT-PCR from the cDNA were 
separated by agarose gel electrophoresis and purified with the help of the 
Fragment Isolation Kit of QIAGEN. The purified fragments were sequenced as 
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described in Example 8. either directly or after cloning them into Hindi digested 
and phosphatase treated pUC19 vector. The full-length cDNA sequence could be 
constructed from the sequences of the overlapping cDNA fragments. The cDNA 
sequence of the NORK gene from the M. truncatula A17 plant is shown in Figure 
5 10. 

Example 9B. 

The alignment and analvis of the genomic and cDNA sequences of the 
10 NORK gene. 

The exact posotion of the exons and introns could be determined by 
aligning the genomic and the cDNA sequences of the NORK gene. The alignment 
of the NORK gene sequences from the M. truncatula A17 plant is shown in Figure 

15 11., the detailes about the position of exons and introns are presented in Table 6. 
The proposed start codone of the NORK gene is the ATG start codone (coloured 
green) at the 1153. bp of the Nhe\ fragment (pNORK_Nhe5), while the proposed 
stop codone is the TAG stop codone (coloured violet) at the 7898. bp. In the 
putative promoter region at the 5' end of the gene can be found the TATA-box 

20 (743-55, 769-85) es CCAAT-box (675-79, 688-91 bp) sequences characteristic for 
the region. At the 3' end of the gene there are several sequences which may serve 
as transcriptional termination signals. There is no difference between the two 
sequences (genomic and cDNA) which is in accordance with the facts that the 
genomic and the cDNA is originated from the same plant, and there is only one 

25 gene in the genome. 

By comparing the sequences to the entries of databases we realized that in 
the case of M truncatula, in the so-called EST (Expressed Sequence Tags) 
databases there are partial cDNA sequences homologous to the NORK gene. 
These sequences were obtained by random sequencing, i.e. certain laboratories 
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made sequencing reactions and single runs on several thousand cDNA clones of 
different libraries without any knowledge about their function. Such sequences 
from the M. truncatula plant can be found under accession numbers AW684661, 
AW685681 and BE203249. The appearance of the cDNA sequences in this form 
5 also support the expression of the NORK gene in the roots of legumes. 

The activity of the NORK gene in the cells, i. e. it is expressed, is proven by 
the fact that we could detect NORK specific cDNA corresponding to the mRNA via 
RT-PCR. We presented as described in Example 14. that the NORK protein is 
synthetized from the mRNA. The amino acid sequence of the NORK protein can 

10 be deduced from the cDNA sequence. The amino acid sequence deduced from 
the longest Open Reading Frame (ORF) of the NORK cDNA is shown in Figure 12. 
The NORK protein consists of 924 amino acids, and it contains characteristic 
sequence motifs pointing to functions, which were revealed on the basis of 
similarity to consensus sequences in databases. The functional representation of 

15 the NORK protein and its characteristic sequence motifs are shown in Figure 13. 

From the above data obtained in this way we concluded that one part 
of the molecule is located extracellularly (extracellular part) which is followed by an 
intracellular part via a transmembrane stretch. According to our knowledge the 
extracellular part contains a so-called LRR (Leucin Rich Repeats) region 

20 (according to the homologies: between amino acids 405-476) which may be 
involved in either the direct or the indirect recognition and in either the direct or the 
indirect binding of Nod factor. Based on the similarities, the proposed intracellular 
part has kinase activity. It is known for a specialist of molecular biology that the 
kinases participate in different signal transduction pathways, they initiate or 

25 transduce the biological signals by phosphorilating proteins. The NORK protein 
may phosphorilate its substrates within the cell and thus it initiates the signal 
transduction cascade leading the development of the nodule. Based on the 
structural analysis of the protein it is possible to suppose that the NORK protein is 
capable for autophosphorilation. 
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Example 9C. 

The alignment of the cDNA sequence of M. truncatula A17 to those of the 
5 mutant (Nod") and the wild type (Nod*) M. sativa plants. 

With the help of primers designed to the NORK gene of M. truncatula 
(Figure 9.) genomic and cDNA sequences (Example 9A.) were amplified from F3 
plants carrying in homozygous configuration the Nod" alleles designated as number • 

10 1 and number 2 (NAB1241/6 and NAB701/28), as well as the Nod + allele 
designated as number 6 (NAB615/28). The nucleotide sequences of the amplified 
products were determined directly or after cloning as described in Examples 8. and 
9A. The cDNA sequence of the Nod" alleles designated as number 1 and number 
2, as well as the Nod + allele , designated as number 6 is shown in Figures 14. and 

15 15., respectively, the amino acid sequences deduced from the cDNA sequences of 
the Nod" alleles designated as number 1 and number 2, as well as the Nod + allele 
designated as number 6 can be seen in Figures 16. and 17., respectively. The 
sequence analysis revealed that the sequences of the NORK alleles designated as 
number 1 and number 2 are identical, that is why only one sequence is presented. 

20 The alignment of the cDNA sequences of the Nod' and Nod + alleles from M. 

sativa to the cDNA sequence of the M. truncatula NORK gene is shown in Figure 
18. One can see from the comparison of the sequences that there are a number of 
differences between the sequences of M. sativa and that of M. truncatula, and a 
few differences - in most cases causing no amino acid changes - can be found 

25 between the wild type and mutant alleles of M. sativa. The most dramatic change 
in the structure of the Nod" allele carrying the nni mutation can be found at the 1 1 . 
basepair in the 13. exon (E13) of the gene. The mutation in the triplet coding for 
tirosine resulted in a STOP codone in the sequence originated from the Nod" plant. 
As a result of this mutation, the synthesis of the protein is terminated earlier, even 
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before the synthesis of the putative active site of the kinase, and it can be 
supposed that the produced shorter protein cannot fulfil its function in the signal 
transduction.. 

The alignment of the amino acid sequences of the NORK proteins of the M. 
5 truncatula A17 es a M. sativa plants is shown in Figure 19. The comparison of the 
sequences reveals that the sequences of M. sativa differ at a number of positions 
from that of M. truncatula A17. The amino acid differences between the Nod + 
alleles of M. truncatula A17 and M sativa are probably the results of so-called 
neutral mutations which do not change the function. The comparison of the amino 

10 acid sequnces of the Nod" and the Nod* NORK alleles reveals that the most 
dramatic change in the structure of the NORK protein in the M. sativa mutant MN- 
1 008 is that the protein consists of only 709 amino acids because there is a STOP 
codon in place of the tyrosine codon in position 710 which causes the termination 
of protein synthesis (see above). There are more than one amino acid changes in 

15 the protein originated from the Nod" allele compared to that of the Nod + allele which 
might result in the loss of function. It cannot be decided that in which order the 
mutations in the nn 1 gene arose and which one was the first that in homozygous 
configuration resulted in the fail of nodule formation, i.e. in Nod" phenotype. 

20 Example 10. 

The number of NORK genes in the Medicago genome and its distribution in 
the plant kingdom 

25 The genetic mapping is a suitable tool to determine the copy number of the 

NORK gene. For this purpose, using the putative extracellular part of the NORK 
gene (this part is specific, there is no homologous sequence in the databasis) as a 
hybridization probe, hybridization experiments were carried out as described in 
Example 3. with the filters carrying the DNA of the individuals from the diploid and 
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tetraploid mapping populations of M. sativa. The 1695 bp length probe, which is 
called as „NORK specific" probe, was prepared by PCR using primers PMRKU4 es 
Pr_RKD2X (Figure 9.) and a cDNA template originated from the M. truncatula A17 
plant as described in Example 9A. The analysis of the autoradiograms showed that 
5 the segregating hybridizing fragments identify a single locus (Figures 20. and 21 .), 
which means that there is only one copy of the NORK gene in the genome, i.e. the 
NORK gene is a single-copy gene. 

The above described probe was used for the hybridization to the genomic 
DMA of different plants. With the help of the QIAGEN Plant DNA Isolation Kit 

10 genomic DNA was isolated as described in Example 2. from different legume 
plants: M. truncatula A17 es A20, Pisum sativum cv. Frisson, Sesbania rostrata, 
Cassia emerginata, Desmodium sp M Vicia sativa, Melilotus alba, Trifolium 
pratense, Trifolium incarnatum, Vigna unguiculata, Macroptilium atropurpureum, 
Vigna radiata, Glycine max, Lotus corniculatus, Lotus japonicus cv. Gifu, and from 

15 non-legume ones: Nicotiana tabacum cv. Small Havanna SR1, and rice. 15-20 ug 
of total DNA was digested by restriction enzymes EcoRI, EcoRV, Dral or HindUl, 
the DNA fragments were separated by electrophoresis and bound to Hybond N+ 
filters (see. Example 3). The autoradiograms obtained from the hybridizations 
performed at 55 °C using the „NORK specific" probe (Example 10.) can be seen in 

20 Figures 22., 23., 24., 25., 26., 27. One can see on the autoradiograms that there is 
only one or two strongly hybridizing fragments in most legumes which means that 
in most of the legumes the NORK gene is a low copy number gene, only one or 
few genes are present in the genome. The autoradiograms obtained after the 
hybridization experiments with non-legume plants show that in these species the 

25 presence of the NORK gene cannot be detected under our hybridization 
conditions. 

Example 1 1 . 
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Determination of the NORK cDNA sequence from other legumes. 

The nucleotide sequences of the NORK gene weres determined from a 
single individual of two other legume species, pea (Pisum sativum) and vetch 
5 (V7c/a villosa). The sequencing of the NORK cDNA from Pisum sativum cv. Frisson 
and Vicia villosa S-1 plants was carried out using the primers designed for the 
NORK gene of Medicago truncatula (Figure 9.) as described in Example 9A. Both 
Pisum sativum cv. Frisson and V7c/a villosa S-1 has Nod + phenotype, similarly to 
Medicago truncatula, diploid, autogamous plants, which means that the two alleles 

10 of their NORK gene has identical sequence. The nucleotide sequence of the 
amplified products was determined directly or after cloning as described in 
Examples 8. and 9A. Figures 28. and 29. show the NORK cDNA sequences of 
Pisum sativum cv. Frisson and V7'c/a villosa S-1 , respectively, constructed from the 
sequences of overlapping fragments. Figures 30. and 31. show the amino acid 

15 sequences deduced from the cDNA sequences of these plants. 

The cDNA sequences determined by us from Pisum sativum and Vicia 
villosa plants as well as the cDNA sequence from the Lotus japonicus Miyakojima 
MG-20 plant deposited in the NCBI database under the Accession number 
AV410167 proves that the NORK gene is transcribed in these plants as well, , i.e. it 

20 is active. It is noteworthy to mention that in Vicia villosa there are either two alleles 
or two copies of the NORK gene (see Figures 29. and 31 .). It can be concluded as 
well that th e nucleotide sequence (bfetNORK gene in plants investigated by us is 
so homologous to that of M. truncatula that the primers designed for the NORK 
gene of M. truncatula could be used for the amplification of different parts of the 

25 NORK gene from alfalfa, pea and vetch. 

Example 12. 
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Complementation of the Nod' mutant M. sativa MN-1008 plant with the 
NORK gene. 

In order to prove that the identified mutation in the NORK gene is 
5 responsible for the Nod" mutant phenotype, the 8.5 kb NheY fragment carrying 
NORK gene (Table 5., Figures 7. and 8.) was cloned into plant transformation 
vectors (see Table 7.)- The NORK gene was cloned in the so-called T-DNA part of 
the Agrobacterium-based plant transformation vectors. Agrobacterium species that 
entered the tissues of plants transfer the T-DNA from their plasmid into the plant 

10 cells (T-DNA transfer), which is followed by the integration of T-DNA (54). The 
random integration of T-DNA into different regions of the plant genome makes 
possible the stable maintenance and inheritance, as well as the expression of the 
genes of the T-DNA. There are two basic methods to introduce T-DNA based 
plasmids into plants: the Agrobacterium tumefaciens and the Agrobacterium 

15 rhizogenes mediated transfer. Using A. tumefaciens strains, transformed calli are 
obtained first, from which somatic embryos have to be induced which have to be 
regenerated to raise transgenic plants. This procedure can be carried out only by 
using so-called embryogenic plants. Using A. rhizogenes strains for infection, hairy 
roots grow on the infected part of the plant, and as a result a chimeric organism is 

20 formed in which only the hairy roots are transformed. 

Since preliminary results showed that the plant (MN-1008) carrying the nni 
mutation is not embryogenic under standard conditions, i.e. plants cannot be 
regenerated from the calli transformed by the A. tumefaciens mediated method, 
two strategies were chosen for the genetic complementation. 

25 (i) Using the Agrobacterium rhizogenes transformation method, when the 

ability for the symbiotic nodule formation of the transformed hairy roots can be 
investigated. Transformed roots of most legume species are able to form 
nodules(55, 56), that is why, in the case of complementation, the formation of 
nodules might be expected on the roots of the mutant plants. 
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(ii) The non-embryogenic MN-1008 plant was crossed as described in the 
Example 1. with an individual of an embryogenic line (Regen SY) which was 
provided by Dr. Deborah A. Samac, University of Minnesota, MN, USA (57). After 
the selection and self-pollination of the embryogenic F1 individuals, the symbiotic 
5 phenotype of the individuals of the F2 population was determined as described in 
the Example 1., and two plants from the Nod" progeny (F2RN28/4, F2RN28/5) as 
well as two embryogenic F1 plants (F1RN28 es F1RN41) was used for the A 
tumefaciens mediated transformation experiments. 

10 Example 1.2A. 

The cloning of the NORK gene into plant transformation vectors 

The fragment carrying the NORK gene was cloned into several plant 
15 transformation vectors which have different copy number and provide different 
DNA-environment in the T-DNA that might affect the expression of the transformed 
gene. 

The 8.5 kb Nhel fragment carrying the NORK gene was cloned into the 
different plant transformation vectors from plasmid pBRC1660 which was 

20 constructed as follows: From the G20P5 clone (see Table 5.) constructed as 
described in Example 6., the insert was cut by the restriction endonucleases Pael 
and Sad and was ligated into the pOK12 (49) vector which was digested with the 
same enzymes and dephosphorilated by the CIP enzyme. From the obtained clone 
designated as pAT688 the insert was cleaved by the restriction endonucleases 

25 C/al es Sad and it was ligated into the vector pBlueScript II SK (Stratagene, 1 101 1 
North Torrey Pines Road, La Jolla, CA, USA). From the clone pBRC1660 obtained 
as described above (maintained in E. coli DH5a strain under the code BRC1660) 
the NORK gene can be deliberated by several enzymes. The pOK12 vector was 
provided by Dr. J. Messing (Waksman Institute, Piscataway, NJ, USA). 
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The 1 1 .8 kb Clal fragment harbouring the NORK gene with a 2.2 kb longer 
5' (promoter) sequence was cloned into the different plant transformation vectors 
from plasmid pBRC1690 which was constructed as follows: The BAC clone 67A11 
shown in Figure 5. was digested by Clal enzyme, then the fragment was re- 
5 isolated from agarose gel as described in the Example 3. and was ligated into 
pBlueScript II SK vector which was digested by the same restriction endonuclease 
and dephosphorilated by the CIP enzyme. 

The pPK459 vector which was used for the construction of the pBRC1678 
clone was developed as follows: The Hind 1 1 \-EcoRl fragment from thepolylinker of 
10 the pSL301 plasmid (59) was cloned between the H/ndlll es EcoRI sites of the 
pGA471 vector described by An et al. (58). The pGA471 vector (58) and the 
pSL301 vector (59) were kindly provided by G. An (Washington State University, 
Pullman, WA, USA) and J. Brosius (Mt. Sinai School of Medicine, New York, NY, ; 
USA), respectively. 

15 The pPR97 vector (60) used for the construction of clones pBRC1666 and 

pBRC1701 was provided by Laszlo Szabados (MTA, BRC, Szeged). 

The pAT680 vector used for the construction of the pBRC1667 clone was 
constructed as follows: the T-DNA part of the pGREEN II vector described by 
Hellens et al. (61) was digested by Hpal enzyme and treated by the CIP enzyme 

20 and was ligated to an EcoRV fragment carrying the nopaline synthase gene 
promoter (nos) driven neomycin phosphotransferase II gene providing kanamycin 
resistance (nos-KmR casette), and the obtained clone was deposited in the strain 
collection under the name pAT678. After removing by EcoRV digestion from the 
carrier plasmid, the intron-containing uidA gene (62) coding for the 0- 

25 glucuronodase gene and driven by 35S promoter of the cauliflower mosaic virus 
(35S-GUS-INT casette; 61) was built into the Stul digested CIP treated pAT678 
vector. The T-DNA containing Bglll fragment was cloned from the obtained 
pAT679 plasmid into the BamHI site of the broad host range vector pAT672 which 
was constructed from the replication origo of plasmid pBBR1MCS-4 (63) and the 
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kanamycin resistance gene of plasmid pK19 (64) as follows: The kanamycin 
resistance gene from plasmid pK19 was amplified as described in Example 4. 
using primers KMU (5-GGCGATCGATAGACTGGGCGG) and KMD (5'- 
TCGTGATGGCAGGTTGGGCG), then the amplified fragment was cleaved by Clal 
5 and reisolated from agarose gel as described in Example 3. The isolated fragment 
was cloned into the Ehel and Clal digested pBBR1MCS-4 vector. The original 
restriction enzyme cutting sites from the resulted pAT671 vector were removed in 
two. steps as follows: The DNA was digested by BamHI and Bsp119l, then BamHI 
and C/al enzymes and the digested DMAs were treated in the presence of 0,05,: 

10 mM dNTP with the Klenow fragment of DNA polymerase I as described in Example 
4. and ligated. The overhanging ends of the DNA resulted after the BamHI and 
Bsp1 191, as well as the BamH\ and C/al digestions were filled: in by the Klenow 
fragment which restored the recognition site (GGATCC) of BamHI. Plasmids pK19 
(64) and pBBR1MCS-4 (63) were kindly provided by D. Pridmore (Ciba-Geigy AG, 

15 Basel, Switzerland) and E. Kovach (Louisiana State University, Shreveport, LA, 
USA), respectively. Vector pGREEN II and casettes nos-KmR and 35S-GUS-INT 
can be obtained from the John Innes Centre, Norwich, England (detailed 
information: www.pgreen.ac.uk) . 

These cloning vectors (pPK459, pAT680, pPR97) were digested by 

20 restriction enzymes shown in the third column of Table 7., purified and 
dephosphorilated by the CIP enzyme as described in Example 4. After a second 
purification step 1 jug of vector DNA was ligated with 1 ^g of the insert carrying the 
NORK gene. The insert carrying the NORK gene was deliberated from the plasmid 
pBRC1660 by enzymes shown in the third column of Table 7. then isolated from 

25 gel as described in Example 3. The ligated DNAs were transformed into E. coli 
strain DH5a, then after the checking of the transformant colonies (see Example 
6A) the E. coli strains carrying the clones were deposited under the codes 
BRC1666, BRC1667, BRC1678 and BRC1701 (Table 7.). 
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The plasmids carrying the NORK gene (pBRC1666, pBRC1667, pBRC1678, 
pBRC1701) were purified from the E. coli strains and were transformed into the 
different Agrobacterium rhizogenes es Agrobacterium tumefaciens strains shown in 
Table 8. The transformations were carried out by electroporation as follows. 
5 Bacteria grown on solid LB medium containing 100 ng/ml of rifampicin antibiotics 
were inoculated into 5 ml of YENB (65) medium containing 100 ug/ml of rifampicin 
and were incubated overnight at 30 °C with good aeration. 200 ml of YENB 
medium was inoculated with this starter culture. The bacterium culture was grown 
with intensive aeration to late logarithmic phase (optical density at 600 nm, 

10 OD 6 oo=0.6-0.8), then the cells were collected by centrifugation. The bacteria were 
washed at 0°C two times with 100-100 ml of sterile distilled water and two times 
with 10 ml of 1mM HEPES (pH=7) containing 10% of glicerol. Bacterial cells were 
resuspended at 0°G in 2-4 ml of sterile 1mM HEPES (pH=7) containing 10% of 
glicerol, and these electrocompetent cells were frozen in liquid nitrogen as 100 ^l 

15 aliquots and stored at -80 °C till usage: 0.1-1 ng of plasmid DNA was added to the 
electrocompetent cells on ice, then the DNA was introduced into the cells at 1 .25 
kV/cm field strength with the help of to E.coli Pulser instrument (BioRad) according 
to the supplier's instructions. Immediatly after the electroporation, 1 ml of LB 
medium was added to the cells, then after an incubation at 30 °C for 120 minutes 

20 the cells were plated onto LB medium containing 100 fag/ml of rifampicin and 
antibiotics shown in Table 7. 

Example 12B. 

25 Transformation using Agrobacterium rhizogenes strains 

The A. rhizogenes strains (BRC1673, BRC1675, BRC1679; Table 7.) 
carrying the NORK gene on plasmid can be introduced into the tissues of the Nod" 
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mutant M. sativa plants by two ways: 1.) through the sectioned surface of 
seedlings* radicle and 2.) through the sectioned surface of shoots of mature plants. 

Transformation via cutting the radicles of seedlings: The seeds of the MN- 
1008 and the Nod" plants (NAB701, NAB809, NAB897, NAB968, NAB2792) 
5 carrying nn-i mutation - which were identified and maintained as described in 
Example 1. - were sterilized and germinated on 1% water agar as desribed in 
Example 1. The rootlets of the 3-4 day old seedling were cut 5-10 mm below 
cotyledon and the wounded roots were dipped into a lawn of A. rhizogenes grown 
on solid LB medium. The A: rhizogenes strains (BRC1673; BRC1675, BRC1679) 

10 were streaked from the strain collection stored at -80 °C onto LB medium 
containing 100 \xgfm\ of rifampicin and antibiotics shown in Table 7. and were 
incubated at 30 P C for 3 days. The plants treated with the Agrobacterium strains 
were placed into squared (10x10 cm) Petri-dishes containing TM-1 medium (66) 
and were grown at 24°C with 16 hour light period and 10000 Lux light intenstity. 

15 After 5 days roots were appearing at the cut surface, and part of them were hairy 
root, i.e. transformed root. 

2. Transformation through the sectioned shoots of mature plants: The young 
shoots of the MN-1008 and the Nod" plants (NAB701, NAB809, NAB897, NAB968, 
NAB2792) identified and maintained as described in Example 1 . that carry the nn^ 

20 mutation in homozygous configuration were cut from the plants and placed 
immediately into tap water. The sectioned surfaces of the shoots were dipped into 
melted parrafine to avoid the entering of the chemicals used for sterilization into 
the vascular tissues. Then the shoots were sterilized as follows: they were washed 
in 70% ethanol for 5 seconds then immediately in 20% bleech containing 0.05 % 

25 Tween 20 for 90 seconds. After 3 washing in sterile destilled water the shoots were 
cut into pieces of 3-5 cm. After dipping the root-proximal surface into 
Agrobacterium rhizogenes lawn grown as described in the previous paragraph the 
distal end of the plants were stabbed into TM-1 medium and were incubated at 
22°C with 16 hour light period and 10000 Lux light intenstity. From the 20. day 
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roots were appearing at the place of the wounding and some of them were 
transformed 

Transformed roots could be identified by the so-called GUS-staining (67) 
because the T-DNA carries the uidA gene coding for the ^-glucuronidase enzyme. 
5 The roots were stained fro the GUS activity as follows: root pieces were incubated 
at 37°C in dark for 12-24 hours in a buffer containing 100 mM TRIS (pH=7.4), 50 
mM NaCI, 2 mM spermidine, 2 mM K 3 Fe(CN) 6 , and 2 mM X-GlcU (5-bromo-4- 
chloro-3-indol p-D-glucuronic acid) dye. The ^-glucuronidase enzyme expressed in 
the transformed roots cleaves the dye and as a result a blue product precipitated in 

10 the cells is formed which can be detected both by eyes and by microscope. The A. 
rbizogenes strains used for transformation and the transformation efficiency is 
shown in Table 9. After the transformation 4 transformed roots were obtained, 
however, no nodule formed in the presence of compatible S. meliloti. The reason 
for this results might be that the T-DNA of the Ri plasmid harbours genes coding 

15 for enzymes involved in plant hormone biosynthesis and the expression and 
biological effect of these genes in the plant cells inhibit nodule formation (A. 
Kondorosi, personal communication). 

Example 1 2C. 

20 

Transformation using Aarobacterium tumefaciens strains 

The A. tumefaciens transformation was carried out as follows: Twenty 
pieces of healthy, dark-green leaves were cut from the alfalfa plant to be 
25 transformed and placed immediately into Petri dishes containing tap-water. The 
leaves were sterilized as follows: after washing with 70 % ethanol for 5 seconds, 
they were placed immediately into a 20 % of bleech containing 0.05 % of Tween 
20, and washed for 90 seconds. This was followed by three washing steps in 
sterile destilled water. The edges and the central vascular tissues of the sterilized 
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leaves were removed and the two pieces were split in two. These leaf pieces - with 
ail the edges cut - were placed into SHO medium containing A tumefaciens which 
was prepared as follows: The A tumefaciens strains (BRC1677, BRC1680, 
BRC/1681, BRC1707; Table 8.) stored at -80 °C were streaked on solid medium 
5 containing 100 ^ig/ml of rifampicin, as well as the proper antibiotics shown in Figure 
7. and incubated at 30°C. A single colony was inoculated by loop into 3 ml of liquid 
YEP medium (in 20 ml tube) and the bacteria were grown by rolling (to provide 
aeration) to the stacioner phase (24 ora). The bacteria were collected by 
cehtrifugation as described earlier, the supernatant was poured away, and the 

10 bacteria were suspended in 12 ml of SHO medium. After keeping in the 
Agrobacterium suspension for 20 minutes, the leaves were removed and the 
excess of suspension was blotted by sterile blotting paper, then the leaf pieces 
were placed onto B5h in Petri dishes (maximum 16 leaf pieces per plate). The 
plates were sealed by air-permeable bands and placed to 24°C and 16 hour light 

15 period with 10000 Lux light intenstity. After 1 week the leaves were removed from 
the medium, washed 3 times with sterile distilled water and placed onto B5hKmCb 
medium then were incubated as described above. After 3 weeks the leaf pieces 
forming calli were placed onto BShKmCb medium and were further incubated as 
described above. The appearing dark green embryos were transferred onto 

20 MMSCb medium and were further incubated as described above. The plantlets 
forming roots and shoots were transferred into jars containing MMSCb medium 
where the putative transformants were raised. The developed plants were 
propagated in vitro as follows: a piece of the shoot containing 3-4 pair of leaves 
was cut under sterile conditions, placed into MMSCb medium in a jar and 

25 incubated as described above. 

After strengthening of the shoots and roots of the propagated plants, DNA 
was isolated from the leaves of the plants as described in Example 2., then to 
detect the transformation events PCR reactions were carried out as described in 
the Example 3. by using primers specific for the nptll gene providing resistance 
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against kanamycin (NPT-U: 5'-ACCCAGCCGGCCACAGTCG-3\ NPT-D: 5'- 
GGGCGCCCGGTTCTTTTTG-3') and for the uidA gene encoding the J3- 
glucuronidase enzyme (GUS-U: 5'-TTATGCGGGCAACGTCTGGTAT-3\ GUS-D: 
5'-AGTCCCGCTAGTGCCTTGTCC-3') that are within the T-DNA. 
5 After transfering to nitrogen-free Gibson medium, the plants were infected 

with S. meliloti strain 41 and the process of nodulation was followed. Transformed 
derivatives of RN28 and RN41 were self-pollinated then germinated as described 
in Example 1. and we checked their nodule formation ability. 

The results of the transformation experiments by using the A. tumefaciens 
10 strains are summarized in Table 10. 
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The composition of the YEP medium: 
10 g/l Protease Peptone (Difco Laboratories) 
10 g/l Bacto yeast extract (Difco Laboratories) 
5 g/l NaCI 

5 Sterilization: 121 C°, 20 minutes. 

The composition of the LB medium: 
10 g/l Bacto-tryptone (Difco Laboratories) 
5 g/l Bacto yeast extract (Difco Laboratories) 
10 5 g/l NaCI 

pH=7.5 

Sterilization: 121 C°, 20 minutes. 

The composition of the YENB medium: 
15 8 g/l Bact Nutrient Broth (Difco Laboratories) 

7.5 g/l Bacto yeast extract (Difco Laboratories) 
Sterilization: 121 C°, 20 minutes. 

The composition of the SHO medium: 
20 Schenk and Hildebrant salt: 



KN0 3 2500 mg/l 

MgS0 4 -7H 2 0 400 mg/l 

NH 4 H 2 P0 4 300 mg/l 

CaCI 2 -2H 2 0 200 mg/l 

25 MnS0 4 H 2 0 10 mg/l 

H3BO3 5 mg/l 

ZnS0 4 -7H 2 0 1 mg/l 

Kl 1 mg/l 

CuS0 4 -5H 2 0 0.2 mg/l 
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FeS0 4 -7H 2 0 



NaMo0 4 -2H 2 0 



CoCI 2 -6H 2 0 



0.1 mg/l 
0.1 mg/l 
15 mg/l 



Na 2 EDTA 



20 mg/l 



Schenk and Hildebrant vitamine solution: 



ThiamineHCI 



myo-inositole 



Nicotinic acid 



PyridoxineHCI 



1000 mg/l 
5 mg/l 
5 mg/l 
0.5 mg/l 



30 g/l saccharose 

0.5 g/l MES (Sigma) 

pH= 7.5 (adjusted with KOH) 

Sterilization: 121 C°, 20 minutes. 

The composition of the B5h medium: 
1 liter of B5 soulution 
30 ml of B5 amino acid stock solution 
1 ml of B5 hormone stock solution 

The composition of the B5 solution: 
Gamborgs" B5 salts: 

KNO3 2500 mg/l 

MgS0 4 -7H 2 0 250 mg/l 

(NH 4 ) 2 S0 4 134 mg/l 

CaCI 2 -2H 2 0 150 mg/l 

NaH 2 P0 4 H 2 0 150 mg/l 

MnS0 4 -H 2 0 13.5 mg/l 

H3BO3 3 mg/l 
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ZnS0 4 -7H 2 0 2 mg/l 

Kl 0.75 mg/l 

CuS0 4 -5H 2 0 0.025 mg/l 

NaMo0 4 -2H 2 0 0.25 mg/l 

5 CoCI 2 -6H 2 0 0.025 mg/l 

FeS0 4 -7H 2 0 27.8 mg/l 

Na 2 EDTA 37.3 mg/l 

Gamborgs' B5 vitamine solution: 

10 myo-inositol 100 mg/l 

Nicotinic acid 1 mg/l 

ThiamineHCI 10 mg/l 

PyridoxineHCI 1 mg/l 



15 0.5 g/l proline 

40 g/l saccharose 

pH= 5.7 (adjusted with KOH) 

8 g/l Phytoagar (GIBCO BRL) 

20 Sterilization: 121 C°, 20 minutes. 

Before plating sterile amino acid and hormone stock solutions are added. 

B5h amino acid stock solution: 

25 

6.65 g/250 ml L-glutamine 
0.83 g/250 ml serine 
0.004 g/ 250 ml adenine 
0.083 g/250 ml glutathione 
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Filter sterilization. 30 ml of amino acid stock solution is added to 1 I of B5h 
medium before plating 

5 B5h hormone stock solution: 

1 mg/ml 2, 4-D-(2,4-dichlore-phenoxy-acetic acid, GIBCO BRL) 
0.1 mg/ml kinetin (GIBCO BRL) 

Filter sterilization. 1 ml of homone stock solution is added to 1 I of B5h 
10 medium before plating 

The composition of the B5hKmCb medium: 

B5h medium containing 25 mg/l of kanamycine and 500 mg/l of 
carbenicilline. 

15 

The composition of the MMSCb medium: 

4.3 g/l Murashige and Skoog salt (GIBCO BRL) 

1 ml/1 1 0OOx Nitsch and Nitsch vitamine stock solution: 



myo-inositol 1 00 mg/ml 

20 glycine 2 mg/ml 

Nicotinic acid 5 mg/ml 

PyridoxineHCI 0.5 mg/l 

Thiamine HCI 0.5 mg/l 

Folic acid 5 mg/l 

25 Biotine 0.05 mg/l 



30 g/l saccharose 

pH= 5.7 (adjusted with KOH) 

8 g/l Phytoagar (GIBCO BRL) 
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Sterilization: 121 C°, 20 minutes. 

500 mg/l of carbenicilline is added after sterilization and before plating. 
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Example 13. 

The introduction of the NORK gene of M. truncatula into other (leguminous 
and non-leguminous) plants 

5 

The Agrobacterium tumefaciens strains described in Example 12. 
(BRC1677, BRC1680) carrying the NORK gene of M. truncatula were used for the 
introduction of the NORK gene into the legume plant vetch (V7c/*a villosa) and the 
non-legume tobacco (Nicotiana tabacum cv. Small Havanna SR1) plant. The 

10 transformations were performed as described in Example 12C with the 
modification that in the case of Vicia 2 mg/l of 2,4-D and 0.2 mg/l of kinetin was 
used, while the pieces of the tobacco leaves were placed onto MS medium (68) 
containing 1 mg/l of benzyl-aminopurine (BAP) and 0.1 mg/l of naphtyl-acetic acid 
(NAA), as well as antibiotics (1 00 mg/l of kanamycin and 500 mg/l of carbenicillin). 

15 After the transformations six N. tabacum and eight V. villosa plants 

originated from independent transformation events were regenerated (Table 1 1 .). 
Total DNA from the transformed plants was isolated and PCR amplification using 
the Pr-RKU4-~Pr-RKD5 primerpair (Figure 9.) was carried out as described in 
Examples 2. and 9., respectively. The sequence of amplified products (exonl- 

20 exon2 of the NORK gene) were determined as described in Example 8. The 
sequencing results revealed that all sequences determined from the transformed 
plants were identical to the DNA sequence of the M. truncatula NORK gene, which 
means that the transgenic plants contains the NORK gene of M. truncatula. 
Proteins from the roots of transformed tobacco plants were isolated as described 

25 in Example 14. and the presence of the NORK protein in the transformants was 
demonstrated by dot blot hybridization. These results show that the NORK gene is 
expressed in N. tabacum. The results of the transformation experiments and the 
characterization of the transformants are shown in Figure 32. 
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The composition of the MS medium: 
Murashige and Skoog salts: 



KN0 3 


1900 mg/l 


MgS0 4 -7H 2 0 


70 mg/l 


NH4NO3 


650 mg/l 


CaCI 2 2H 2 0 


40 mg/l 


KH 2 P0 4 H 2 0 


70 mg/l 


MnS0 4 -H 2 0 


22.3 mg/l 


H3BO3 


8.2 mg/l 


ZnS0 4 -7H 2 0 


8.6 mg/l 


Kl 


0.83 mg/l 


CuS0 4 -5H 2 0 


0.025 mg/l 


NaMo0 4 -2H 2 0 


0.25 mg/l 


CoCI 2 6H 2 0 


0.025 mg/l 


FeS0 4 -7H 2 0 


27.8 mg/l 


Na 2 EDTA 


37.3 mg/l 


Murashige and Skoog vitamine solution: 




myo-inositol 


1 00 mg/l 


Nicotinic acid 


0.5 mg/l 


ThiamineHCI 


0.1 mg/l 


PyridoxineHCI 


0.1 mg/l 


Glycine 


2 g/l 


Saccharose 


20 g/l 


pH= 5.7 (adjusted with KOH) 




Phytoagar (GIBCO BRL) 8 g/l 





Sterilization: 121 C°, 20 minutes. 
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Example 14 

Immunological detection of the NORK protein 

5 

In order to prove the presence of NORK protein in the cells of alfalfa, 
and in the transformed tobacco, a non-legume plant, two exons (2nd and 3rd 
exons) of the N-terminal (5') part of the NORK gene were expressed in 
Escherichia coli. Antibodies were produced against the purified proteins and the 
10 NORK protein was detected by immunological method (dot blot analysis). 

Example 14A. 

Cloning of the DNA fragment coding the N-terminal part of M. truncatula 
15 A17 NORK protein into the expression vector pTrcHisA 

The pTrcHis (A, B and C) (Invitrogen BV Groningen, The Netherlands) 
expression vectors are made possible to express recombinant protein containing 
six histidine (His-tag) and a recognition site of enterokinase enzyme in fusion. The 

20 His-tag allows us to purify the recombinant protein in a single-step purification 
procedure (recommended by the suppliers). For this reasons this system was 
used to express the extracellular part of the NORK protein. 

A DNA fragment of the M. truncatula NORK cDNA sequence described in 
Example 9 and showed in Figure 10 was produced by PCR amplification using the 

25 RKEx2U and Pst-A3 primers (Figure 9.) as it is described in Example 9A. The 957 
bp long amplified cDNA fragment coded 319 amino acids (amino acids from 33 to 
352) of the supposed extracellular part of the NORK protein. pTrcHisA vector was 
used for cloning to fuse the sequence of the NORK protein portion to be 
expressed and the His-tag in frame. The pTrcHisA vector and the amplified cDNA 

30 fragment were digested by BamHI and Pstl restriction enzymes (Fermentas AB) 
according the supplier's instructions. The fragment and the vector prepared in this 
way were ligated (construct pTrcHisA::dNORK) as it is described in Example 4. 
The ligation is transformed into E. coli TOP10 competent cells detailed also in 
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Example 4. After transformation and plating, plasmid DNA was isolated from the 
ampicillin resistant colonies using the QIAGEN Plasmid Isolation Kit (see Example 
4.). The frame and the fidelity of the sequence of the insert were checked by 
sequence analysis (see Example 8.). The amplified and correctly inserted clone 
5 was termed TOP1 0(pTrcHisA::dNORK). 

Example 14B 

Expression of the extracellular part of the NORK protein in E. coli 

1Q 

The recombinant protein (rNORK) produced by the Xpress System 
Protein Expression system (construct pTrcHisA::dNORK) was purified by the 
Xpress System Protein Purification (Invitrogen BV Groningen, The Netherlands) 
according to the manufacturer's instructions. Overnight culture of the clone 

15 TOP10(pTrcHisA::dNORK) (see Example 14A.) grown in 2 ml of LB medium 
containing ampicillin in 50 jj,g/ml concentrations was diluted 250 times in 50 ml of 
the same medium. The culture was grown at 37°C with vigorous shaking to an 
OD 6 oo=0.6. Then the expression of protein was induced by adding 0.5 ml IPTG 
solution (100 mM) and the culture was shaked for 5 more hours. Bacterial 

20 suspension was collected by centrifugation (centrifuge: Sorvall RC5, GSA rotor 
with the proper tubes, 4°C, 10 min, 8k rpm) than the bacterial pellet was 
resuspended in 10 ml of buffer (recommended by the kit: 20 mM NaH 2 P0 4 , 500 
mM NaCI; pH 7.8). Disruption of bacteria was made by sonication 3 times for 10 
sec using the smallest head of MSE Ultrasonic Disintegrator. The supernatant of 

25 the disrupted and collected bacterial suspension was bound to affinity column 
through the six histidine amino acid residue present in the fusion protein. Bacterial 
proteins were removed by three washing steps using solutions with decreasing pH 
(20 mM NaH 2 P0 4 , 500 mM NaCI; pH 7.8; 6.0 and 5.5). The recombinant protein 
was eluted from the column by 5 ml elution buffer of the kit (20 mM NaH 2 P0 4 , 500 

30 mM NaCI; pH 4.0) having 1 mg/ml protein concentration. The eluted fusion 
protein was used directly to produce antibodies in rabbits. 
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Example 14C 

Producing polyclonal antibodies against the extracellular part of the NORK 
protein 

5 

Three, 4 months-old rabbits were injected by native protein in 1 ml 
final volume dissolved in PBS (69) and purified as described in Example 14B, 
respectively. Approximately 100 \xg of total purified protein (antigen) was injected 
subcutan into four different sites of the neck. For the primary injection 50% 
10 complete Freunds adjuvant (GFA) (cat. no: F-5881 Sigma-Aldrich Kft. Budapest) 
and 50% recombinant protein dissolved in PBS were used. Before the primary 
injection the rabbits were bled and the gained sera were used later as a negative 
(isotype) control. 

After the fourth and seventh weeks of the primary injection boost 
15 injection was performed to achieve a heigher state of immunity 
(hyperimmunization). For the second injection 10% CFA + 40% incomplete 
Freunds adjuvant (I FA) (cat. no: F-5506 Sigma-Aldrich Kft. Budapest) and 50% 
recombinant protein dissolved in PBS were used. Ten days later the antibody 
production was checked. Serum was prepared from 1-5 ml blood obtained from 
20 ear's vein. The obtained blood was kept at room temperature for 4 hours and after 
coagulation it was centrifuged (2700g, 4°C, 10 min). A part of the supernatant 
(serum) was kept at 4°C untill utilization and the rest was stored at -20°C in 100 uJ 
aliquots. The antibody content of the antisera was checked by ELISA (Enzyme- 
Linked Immunosorbent Assay) and dot blot analysis. The rabbits were kept alive 
25 and were injected again on the 7th week after the primary injection. For the third 
injection 50% I FA and 50% recombinant protein dissolved in PBS were used. On 
the 10th day after the third injection, the rabbits were bled and the samples were 
checked again by ELISA and dot blot analysis. Then 80 ml blood was taken from 
the rabbits. 

30 
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Example 14D 
ELISA 

5 500 jag recombinant protein dissolved in PBS per well was plated into 96- 

well microtiter plate and incubated at 37°C for 2 hours. Then the plate was 
washed three times by washing buffer (0.05% Tween-20, 0.5 M NaCI in PBS) and 
was blocked by 0.5% gelatine (1 hr, 37°C). After three washes the plate was 
incubated at 37°C for 1 hour in the presence of 100 and 1000-fold diluted rabbit 

10 anti sera in 50\x\ washing buffer. After three repeated washes the plate was 
incubated (1 hr, 37°C) with the second antibodies diluted in 50|j.I washing buffer 
for 10000-fold. Horseradish peroxidase conjugated anti-rabbit IgG antibody (cat. 
No: A-0545 Sigma-Aldrich Kft. Budapest) was used as a secondary antibody. After 
four washes the plate was developed by 0.5 mg/ml OPD substrate (cat. no: P- 

15 5412 Sigma-Aldrich Kft. Budapest). OPD was dissolved in PBS containing 0.1 M 
citrate than 5 yd 30% H2O2 was added to it. After blocking the reaction by 4 M 
sulphuric acid (50 nl/well), the plates were read with a microplate ELISA Reader 
(Labsystem Multiscan Biochromatic) at 492 nm. The most specific antibody 
selected based on the results of ELISA test was used in 1000-fold dilution to 

20 investigate the extracts purified from roots of different plants. 

Example 14E. 

Protein purification from the roots of mutant (Nod") and wild type (Nod*) 
25 Medicaao sativa and transformed and control Nicotiana tabacum plants 

Purification of proteins isolated from the roots of MN-1008 (Nod") and 
NAB814 (Nod + ) Medicago sativa as well as control (non-transformed) and 
transformed Nicotiana tabacum plants (see Example 13) were performed by the 
30 slightly modified methods of Huang and Berry (70) as well as Cheong and Hahn 
(59). 1 g fresh roots were homogenized in 1 ml homogenisation buffer (25 mM 
Tris-HCI pH 7.0, 30 mM MgCI 2 , 2 mM dithiotreitol (DTT), 0.2 mM 
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phenylmethylsulfonyl fluoride (PMSF)) in the presence of quartz sand on ice. The 
homogenate was filtered through four layers of cheesecloth (commercially 
available) and centrifuged at 6000g (4°C, 15 min). Supernatant and the crude 
membrane fraction obtained by centrifugation of the supernatant (20000g, 4°C, 30 
5 min) were used for dot blot analysis. 

Example 14F 

Pot blot analysis of proteins purified from the roots of Medicago sativa and 
10 Nicotiana tabacum using specific antibodies against the NORK protein 

Ten jjg of protein samples purified from alfalfa and tobacco plants (the 
filtered and centrifuged supernatant of plant homogenized samples described in 
Example 14E) were dot-blotted onto nitrocellulose filter (Hybond-ECL cat. no: 

15 RPN303D, Amersham Pharmacia Biotech Inc, Little Chalfont, Buckinghamshire, 
England) (Figure. 32.). Dot blot analysis was performed with the 1000x diluted 
rabbit antisera as detailed below. Filters were blocked by blocking solution (2.5% 
milk powder, 1% polyvinylpyrrolidone (PVP), 0.1% Tween-20 in PBS) at room 
temperature for 1 hour. The filters were incubated in the same solution containing 

20 the generated antibodies at room temperature for another 1 hour. The filters were 
washed three times each for ten minutes in 50 ml washing solution (0.1% Tween- 
20 in PBS). The filters were incubated further with the horseradish peroxidase- 
conjugated anti-rabbit IgG (5000x diluted in washing solution) at room temperature 
for 1 hour. After three washing steps the filters were developed by 

25 chemiluminescent reaction. After incubation of the filters In developing solution 
(100 mM Tris pH 8.5, 450 uM coumaric acid, 2.3 mM luminol and 0.03% H 2 0 2 ) for 
1 min, they were visualized by an X-ray film exposure (Sterling Diagnostic Imaging 
Inc., Newark, DE 19714 USA) (Figure 32.). The positive signal detected in the 
case of the Nod + and Nod" alfalfa as well as of the transformed tobacco plant 

30 containing the NORK gene demonstrated the reaction of the NORK protein with 
the produced specific antibody. 
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CLAIMS 

1. Nucleotides sequence coding for the polypeptide having NOD region 
specific receptor kinase (NORK) gene activity. 
5 2. Nucleotide sequence coding for the amino acid sequence of Figure 17. 

3. Nucleotide sequence coding for the amino acid sequence of Figure 12. 

4. Nucleotide sequence coding for the amino acid sequence of Figure 30. 

5. Nucleotide sequence coding for the amino acid sequence of Figure 31 . 

6. The nucleotide sequence according to any of claims 1 to 5 which is a 
10 genomic or synthetic DNA molecule: 

CCATATTTTAACAATATTCTTTCTTCTACAAGGGTATAACTTTTATACAAGTTCACTATA 
TTATAGGATTGATCAAGGTTCATTTTTTCTTTCTTTGAAAAATCTCTAAGGGGTGTGGTT 
TCCAAGGCAGAAAATGAAATAGAATGCAGAAGAATTTGTATGGTACTATAAAGGGAAGAT 

15 GAAAAGTTAGTTAGCATGGATTCAAGTTTGATAACCCTTTGGGGTAAAATCTCTTTCAGA 
TTATGATGGAGCTACAAGTTATTAGGATATTTAGATTGGTTGTGGCATGTGTTCTTTGTT 
TGTGTATATTTATCAGATCAGCTTCTTCTGCAACTAAAGGGTTTGAGAGCATATCATGTT 
GTGCTGATTCCAATTACACAGATCCAAAAACAACCCTAACTTATACAACAGATCACATCT 
GGTTCTCTGATAAAAGAAGTTGCAGACCAATACCCGAAATTTTGTTTAGCCACAGAAGCA 

20 ATAAAAATGTTCGAATATTTGAAATAGATGAAGGAAAGAGATGTTATACTTTGCCAACAA 
TTAAGGAT CAAGTATATTTGATAAGGGGTGTATTT CCCTTTGATAGTTTAAATTCTTCGT 
TTTATGTTTATATCGGGGTAACAGAACTAGGTGAATTAAGATCGTCTAGACTCGAGGACT 
TGGAAATCGAGGGAGTTTTTAGAGCCACCAAAGACTATATTGATTTCTGCTTATTGAAGG 
AAGATGTCAATCCCTTCATTTCTCAGATTGAATTGAGGCCATXACCTGAAGAATACCTAC 

25 ATGGTTTCGCTACTAGTGTTTTAAAACTGATAAGCAGAAATAATCTTGGTGACACAAATG 
ATGATATAAGGTTCCCAGATGACCAAAATGATAGAATCTGGAAACGGAAAGCAACTTCAA 
CTCCATCATCTGCCCTTCCCCTGTCTTCCAATGTCAGCAATGTTGACCTCAAAGACAGTG 
TCACACCTCCTCTACAAGTCCTACAAACAGCTCTTACTCACCCTGAGCGATTGGAGTTCG 
TCCATGAT GGCCT CGAGACCGATGATTATGAATACTCTGTGTTTCTCCACTTTCTTGAAC 

3 0 TAAATGGCACTGTCAGAGCAGGACAAAGGGTGTTTGACATCTATCTAAACAATGAGATTA 
AAAAGGAGAAGTTTGATGTTTTGGCTGGAGGGTCCAAGAACAGTTACACTGCCTTGAACA 
TTTCAGCAAATGGATCACTCAATATAACCTTAGTCAAGGCATCTGGATCTGAGTTTGGAC 
CCCTTTTGAATGCCTATGAAATCCTGCAGGCACGGTCGTGGATTGAAGAGACCAACCAAA 
AAGATTTGGAACTTATTCAGAAGATGAGAGAAGAACTGCTGCTGCACAACCGAGAAAATG 

35 AAGCATTGGAGAGTTGGAGTGGAGACCCTTGTATGATTTTCCCCTGGAAAGGAATAACAT 
GTGATGATTCAACTGGTTCATCTATTATCACTATGCTGGATCTTTCTTCCAATAATCTCA 
AGGGAGCAATTCCTTACTTTGTCACTAAGATGACCAATTTACAAATACTGAACCTGAGCC 
ACAACCAGTXCGATTCGTTATTCCCCTCGTTTCCACCGTCCTCCTTGCTGATATCATTGG 
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ATCTGAGCTACAATGATCTTGATGGACGGCTTCCAGAATCCATTATCTCACTGCCACATT 
TAAAATCATTATATTTTGGCTGCAATCCATATATGAAGGACGAAGATACAACAAAGTTGA 
ACAGTTCACTAATCAATACAGATTATGGGAGATGCAAAGGAAAAAAACCAAAGTTTGGAC 
AAGTATTCGTGATTGGAGCTATTACAAGGGGATCACT07TTGATTACTTXGGCTGTTGGAA 
5 TTCTATTTTTTTGCCGTTATAGACACAAGTCAATTACTTTGGAAGGATTTGGTGGAAAGA 
CCTACCCAATGGCAACAAATATAATCTTCTCTTT GCCAAGCAAAGACGATTT CTTCATAA 
AGTCTGTATCAGTTAAACCATTCACTTTGGAGTATATAGAGCAGGCTACAGAACAGTACA 
AAACTTTGATAGGTGAAGGAGGATTTGGTTCTGTTTACAGAGGCACTCTAGACGATGGTC 
AAGAAGTGGCAGTGAAAGTGCGGTCATCCACATCAACTCAGGGAACCCNAGAATTTGATA 

10 ATGAGCTAAACCTACTTTCAGCTATACAACATGAGAACCTGGTGCCTCTTCTGGGTTACT 
GTAATGAGTATGATCAACAAATTCTCGTGTATCCATTCATGTCCAATGGCTCTTTGCTAG 
ATAGACTATACGGGGAAGCATCAAAGAGAAAAATATTAGACTGGCCAACTAGACTCTCTA 
TTGCTqTCGGTGCAGCTCGAGGTTTGGCATATCTTCACACATTTCCAGGACGTTCTGTAA 
TACAC AGGGACGTAAAATCGAGCAATATACTGCTGGATCAGAG CATGTGTGC TAAGGTTG 

15 CAGATTTTGGTTTCTCAAAATACGCTCCTeAGGAAGGAGACAGTTATGTTTCCCTTGAAG 
TAAGAGGAACTGCAGGGTATCTGGATCCTGAGTACTACAAAACCCAGCAATTATCTGAAA 
AAAGTGATGTTTTCAGCTTTGGTGTGGTTCTACTTGAAATTGTAAGTGGACGGGAACCTC 
TCAAC ATAAAGAGACCACGGATCGAGTGGAGCTTGGTTGAATGGGCTAAACCATAC ATAA 
GAGCATCAAAGGTGGATGAAATTGTAGATCCTGGCATCAAGGGAGGATATCATGCAGAAG 

20 CATTGTGGAGAGTTGTGGAAGTAGCACTGCAATGTCTAGAACCCTACTCAACATATAGGC 
CATGCATGGTTGATATTGTCCGCGAGTTGGAGGATGCTCTCATTATTGAl^AACAATGCAT 
CTGAATACATGAAATCCATAGACAGCCTTGGAGGATCCAACCGCTACTCAATTGTTATGG 
ACAAACGGGCGCTGCCTTCAACTACATCTACAGCAGAATCAACTATCACAACCCAAACCT 
TGACACACCCTCAACCGAGATAGTAAATGGGTCGATGGAATTCTTTTGATTTGTTTTTTA 

25 TCATTGCTTTAGTAATATCCCATTTTAAATGGTAAAGGAGAAAAATACTACTTTTGATTG 
TATTTTCATCCACTCTATGTTTCTTGAAACTGAATCTCTCTTGCTCAGCCCCAGTTTTTA 
TGGGTGAAAAGAATAATTTGGGTCAAATGCAAGTGAAACCATATGGTGCATAATTTGAAA 
GCCATATTATATCATTTGCTAAGTCCAAAGTAAAAATTTCACAAACTAGTTAGATTGCGA 
TTTAGTCTATACACACTTCAACAGAGCTATATACACTAT 

30 

or a deletion product, recombinant form and variant thereof. 

7. The recombinant form of any of the nucleotide sequences of claims 1 to 5 
which comprise a functional promoter sequence. 

8. Oligonucleotide which comprises at least the functional part of the 
35 nucleotide sequence of any of claims 1 to 7. 

9. Oligonucleotide hybridizing to any of the nucleotide sequences of claims 

1 to 8. 
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10. Antibody capable of detecting the polypeptide having NORK biological 
activity. 

11. Antibody capable of detecting the polypeptide encoded by the DNA 
sequences according to any of claims 2 to 9. 

5 12. Transgenic plant which comprises the nucleotide sequence of any of 

claims 1 to 9. 

13. The transgenic plant according to claim 12 which is a crop plant. 

14. The transgenic plant according to claim 12 which is tobacco plant. 

15. Transformant cell which contains the nucleotide sequence according to 
10 any of claims 1 to 9. 

16. The amino acid sequence according to Figure 17 and a functional 
fragment thereof having NORK activity. 

17. The amino acid sequence according to Figure 12 and a functional 
fragment thereof having NORK activity. 

15 18. The amino acid sequence according to Figure 30 and a functional 

fragment thereof having NORK activity. 

19. The amino acid sequence according to Figure 31 and a functional 
fragment thereof having NORK activity. 

20. Process for preparing plants of Nod + phenotype from plants of Nod" 
20 phenotype characterized by transforming a plant or the reproducing form thereof 

with a DNA according to anv opf claims 1 to 9. 
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Figure 1. (1/2) 
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Figure 1. (cont 2/2). 
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51,5 


le- 05 


MADS-2; 


Coding for MADS-box protein 


emb/CAA71739.1 


51,5 




RT; 


Coding for reverse transcriptase 


gb|AAD21515.1 


34,8 


6.2 


URNS: 


Coding forisoleucinetRNA 


dbj/D82067,l 


129 


le- 28 


LEC; 


Coding for lectin precursor 


gb/AAB51442,l 


127 


le- 35 


RPE: 


Repetitive DNA 


gb/AAB50037.1 


122 


2e- 2 « 


RTR: 


Retrotransposon 


dbj|BAA96774.2| 


201 


3e- 50 


LEC: 


Coding for lectin precursor 


gb/AAB51442,l 


84,3 


le- 38 


LEC: 


Coding for lectin precursor 


gb/AAG00508,l 


96,7 


3o* 


LEC: 


Coding for lectin precursor 


gb/AAB51442.1 


132 


5e- 3 ° 


LEC: 


Coding for lectin precursor 


gb/AAB51442,l 


137 


le- 31 


LEC: 


Coding for lectin precursor 


gb/AAB51442.1 


138 


8e- 32 


127/30: 


LG127/30 like gene 


emb/CAB37543.1 


82,3 


7e- 22 
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Figure 8. (1/3) 



GCTAGCTTAATTTGTGAAGTAGTTTTCAGTCAGCTCATGCTGAACAGTTATGGTAGTTTT 
ACAAGGATGTAAGGGGTTACTTGCTCTTTTGTGGTATCATTGAGCCACTTTCCACTTTCT 
TAATTCTTTGCTATCAAATTTAAATATATTATATTATTGTAGGAAATATGAAAAAGAAAC 
C C TAG AT G C T T T AAT T G T G C T AAG C AT AT AAG AT AC AAAAAAAAAAAAG T C T AAT C T C T T 
T T G AT AGG AAAC T G AC AC AAGG T T C GAT C C T C AGAAC C AA C TAT GAG AC T AAT T T T T G C T 
T G AAT T G G T T AAT C AAC T GC AT C G C AC T CAT T G AAT AAAAAAG T AC G T T TAG AT T G AC G G 
T G AAT T T AAC AT AAT C AC AG T T G ACC AC T T T GAT T T T C AC G AAAG T T GC AT G T AAC T T T C 
GTGCTTGAATATAAAGTCACGGTGCCACAATGGTTTTGTCAATATCGCCGTGAATCCAAA 
CATGCACTTAATATGTTTTTATATTAATGGAAACCTATGTATATAAATATAAAATACTTA 
AG CAT AT AT AT CC T T GC AAC AAT AC C AAAAAAGT T T AAAAT T T G C T C C AAAG AT AC TAT T 
C AC GG T T T C TAT T G T GAT G CAT C CAT G G AAT G AGAAGG AA G AAAAC AG AAG AAG AC G AAG 
G AC TGGTCTGTC C AAT TAT T T AAAC CAT AT G AAT AT T T T TAT AT AC AG AT AAT G AAG C AA 
CC AC C AC T AAAAC AAT AAG AAT AAAT AAT AAT G T T C AAC AT CAT C AAT AAT AAG AAT AAT 
AATGTTTAACATCAACAATAAAACTATAGGAAAAAAACATAATCAACTATGCATTGTACT 
AATTCAATCTCTAACTCGTCTTCCATCTCTTTCCTAGCTACCTCCTGCAGTTTCCTTTCC 
AG GCC T AAAG T C AAAC AC CAT AT T T T AAC AAT AT TCTTTCTTC T AC AG C T T T T AT ACAAG 
TTCACTATATTATAGGATTGATCAGGGTTCATTTTTTCTTTCTTTGAAAAATCTCTAAGG 
GGTGCTGTTTC C AAG GCAG AGAAT G AAAT AG AAT T C AG AAG AAT TTTTATGT T AC T AT AA 
AGGAAAGATGAAAAGTTAGTTAGCATGGATTCAAGTTTGATAACCCTATGGGGTAAAATC 
T C T T T C AGAT TAT GAT GGAGT T ACAAGT T AT T AGGAT AT T T AGAT T GGT T GT GGC AT T T G 
TTCTTTGTTTGTGTATATTTATCAGATCAGCTTCTTCTGCAACTAAAGGTTAGTAGCTAA 
ATACTATAATTCTTTAAGATCATAATAAIATCTATTACTTGATTTCTTTCCCTTTAAACA 
TAG AAAAC AAC AT AT T T T AAT T AAC AT G AAAG G C CAT G G G AT GAT CAT AAT T AAT AAT G A 
GAAGAAAAT AAAT AAGAAAC TTGTTTTTTT T ACAAGG AT T G T AAAAT AG AAC TAG TAG T T 
TCATACTTTCAATACTGAGAATCTTGAAACAATTTCACTTTTTCTTTATGTTGCTAGAAT 
TTCTTTCAAGGGAAAATCCAATTTTGTACAAAATGAATTTAACTTGTGACATTTTCCTTG 
TAG G G T T T GAG AG CAT AG CAT GTTGTGCTGATT CAAAT T AC AC AG AT C C AAAAAC C AC C C 
T AAC T TAT AC AAC AG AT C AC AT CTGGTTCTCT G AT AAAAG AAG T T G C AG AC AAAT AC C C G 
AAAT T T T G T T TAG C C ACAG AAG C AAT AAAAAT G T T C G AAAAT T T G AAAT AT AT G AAG G AA 
AG AG AT G T TAT AAT T T GC C AAC AG T T AAG G A T C AAG TATA T T T GAT AAG G G G CAT AT T T C 
CCTTTGATAGTTTAAATTCTTCGTTTTATGTTTCGATCGGGGTAACAGAACTAGGCGAAT 
T AAG AT C G T C TAG G C T C GAG G AC T T G G AAAT T GAG G GAG T T T T T AG AG C C AC C AAAG AC T 
AC AT AG AT T T C T G C T TAT T G AAGG AG GAT G T C AAT C C C T T CAT T T C T C AG AT T G AAT T G A 
GGCCATTACCTGAAGAATACCTACATGGTTTCGGTACTAGTGTTTTAAAACTGATAAGCA 
G AAAC AAT C T T G G T G AC AC AAAT GAT GAT AT AAG G TAT G T GAT C T TAG T T TAT T T T TAG G 
TAGATTCCACCTCTATTTTACAGGGAGTGTCTCTCAGGAAACCTAAAAGGCTTAGGGTTT 
G T C AC T AC T T GGT TCT T AT G GAAGCAT CAT AT G T T CAT AC T TAG TAG AT AT AT AT T G T AT 
AAAAAT C AAC AT CTTTCTCCG T AAG GG AT AG AAAT T G C AAT T CAT G T TAT G T T GAG CAT A 
T T T AAAC C AAAAT AAT T T GAG G AAT AT GAT G CAT GAAC CTTTCTGC CAAAT G C AT G AC AT 
AAC C TAT G T T T C AC T T T AC AT A TAG AAT TAT AAG AT G T G T T T AC AT C T T TAT AT T AAC T T 
T GAT C T T TAT GAT GC AC T GAG AT AC AC C T G AAT T T AAAT AAC AAAT G GAG AAG CAAG AAT 
C TAG AAC T C T AC CAT T C AAT C CAT T G T AAT A T GG AAC T C AC AAG AAT T AAC TAT T G C C AC 
T T C AG AT T AAAT AC CC T AAG AG TGTTTTGTG GAT GAGGG AAT G T T T T GAG G GAAT G T AAT 
GTTTTGAGGGAATTCCTCAAAACATTACATTACCTCAAAATATTCCCCCATCCAAACACA 
CTATAAAGGATCCTAATCTAGTGTAGCCTTAGTTTTCTATTGAAGTGATTTGTGTTTAGA 
GTTTGGTCTGCATCTGCAACTTTGTTTTGTTGCTGTAATGCCATGACTATGAACACTGAC 
AG AAAGG T C T CAT AAT AT C G GT AT C T AT C AAT T T TAG G T T C C C AG AT G AC C AAAAT GAT A 
GAATCTGGAAACGGAAAGAAACTTCAACTCCAACATCTGCCCTTCCACTGTCTTTCAATG 
T C AG C AAT G T T GACCT CAAAGAC AG T G T CACACCT C C T C T AC AAG T C C T ACAAACAG C T C 
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Figure 8. (cont. 2/3) 



TTACTCACCCTGAGCGATTGGAGTTCGTCCATGATGGCCTCGAGACCGATGATTATGAAT 
ACTCTGTGTTTCTCCACTTTCTTGAACTAAATGGCACTGTCAGAGCAGGACAAAGGGTGT 
T T G AC AT C TAT C T AAAC AAT GAG AT T AAAAAG GAG AAA T T T GAT GTTTTGGCT GG AG GG T 
C C AAG AAC AG T T AC AC T GC C T T G AAC AT T T C AG C AAAT G GAT C AC T C AAT AT AAC C T TAG 
TCAAGGCATCTGGATCTGAGTTTGGACCCCTTTTGAATGCCTATGAAATCCTGCAGGCAC 
GG T C G T GG AT T G AAG AG AC C AACC AAAAAG AT T G T AAG T G T AC AAT AAG AT G C T AAAT T G 
ANGAATTTTAT TAACCTTTAACCAATTTTCTATTTCATTCTCCCTCTTACACTAACACTT 
TTTTTCCTTT C AG T G G AAGT TAT T C AG AAG AT GAG AG AAG AAC T G C T G C T GC AC AAC C AA 
GAAAATGAAGCATTGGAGAGTTGGAGTGGAGACCCTTGTATGATTTTCCCCTGGAAAGGA 
AT AAC AT G T GAT GAT T C AAC T G G T T CAT C TAT TAT C AC T AAG C T G T AAG T C C T T C C AC T T 
TTTAGGGTCTGTTTAGATTTGCTTATTTGAGTTTATCTATTGAAATAAACACTTATGACA 
C T G T T T G G AAG AG C T TAT G AAAAC AAC T T AT AG T T TAT AC G AAAAC AAG T T G AC T T T G T T 
AT AT C TAT T T T AT AGAAAT AG C T TAT AAG T AAGAAC T TAT AT GAT AAGCGT T TAT G C TAT 
AAAC G C T C AAT T T AAC T G T T TAT C C AAACAGGAC T T T AAT G CAT C CAT AT G T G T T T ACAA 
GTTTTGCTCTATTTTCTGCAGGGATCTTTCTTCCAATAATCTCAAGGGAGCAATTCCTTC 
CATTGTCACTAAGATGACCAATTTACAAATACTGTACTGCTTCTAAATCTCATTTAAAAA 
CATCCACATTATTTTTGTGGGAACAGTGGACTGTTTTTATCCATTAGTTAATAATGTTGT 
AAC AT T T T T G T TAT G C GC AGG AAC T GAG C C AC AAC C AG T T C GAT AT G T TAT TCCCCTCGT 
TTCCACCGTCCTCCTTGCTGATATCATTGTAATTATCTCTTCTCATGTTTAACAAATGAA 
CTAGGATGATACTAATATGAGCTTATAGCACTTCTATTATCCTTGTAAATGTTAACATAA 
ACAAGCACATGCAGATTATAGAACTAAAATATGATAGAATATGCTGTATATAGTCCCAAC 
T CAT G T T G G T T AC AT AAGG T AG AT AT TAG AC AG T C T C AAT G C T G AAAC C AAT AT C T G T G T 
GCATGATGCATCTTAATCTTTAAAAGATTTTATGATTAAGGCTGTTCTCTATGGCACTTC 
AG TAG GAG C AG T AAC AC AT C TAT G T G AAAG T T CC AT AT G AAAT C C T T GAG AAAT AT G T T T 
TGACATTATGTTTCATATATTGCTGAATTTCTTTCTTCACCAAGGTTCGAATCTTAGAGA 
AATTTCCTACAGTTATTAGTGCCATTGAAAAATTACTACTAAATCTTTTTCAAATGATTG 
ATCTACATTTATGGATCAGGGATCTTAGCTACAATGATCTTTCAGGATGGCTTCCAGAAT 
CCATTATCTCACTGCCACATTTAAAATCATTGTAAGTTTTATATGTTGGCATTCTACTTC 
AT CC AT AT TAG GAAG C TAT T T T CGAT T G T T G TAT AT T T T T AAT AT AC T T CAT T TAT T CAG 
C T T G TAT T T GAT T TAT T T C C AG AT A T TTTGGCTG C AAT C CAT C T AT GAG T G AC G AAG AT A 
C AAC AAAG T T G AAC AG T T C AC T AAT C AAT AC AG AG TAT GAAG TAT T A T T GAT G CAT AAC A 
AT AG AAG T T T T T AAAAT AAAAT AAAAT AAAT C AAAAT T T T AC TCGTTTCTGTTTT C AC AG 
T T AT GG G AG AT G C AAAGC AAAAAAAC C AAAG T T T GG AC AAG TAT T C G T GAT T GG AG C T AT 
TACAAGTGGATCACTTTTGATTACTTTGGCTGTTGGAATTCTATTTTTTTGCCGTTATAG 
ACACAAGTCAATTACTTTGGAAGGATTTGGAAAGACCTACCCAATGGCAACAAGTANGCA 
T T T TAT C T T T TAT AAT C TAT T G T AAC AT AT G T C TAT C T GC AT T C T AAT G T AAC AT AT T T G 
C T C T T TAG AC T AAC AAAC AAAAAC TAT AG T C AC AT CAT T C T GC AT AG AC AAT C T CAG AT C 
AC AAG AT C T AAG TGTTATCTT T AAC AT AG AGC AT GG T T T AC AAAT AC AG G G AAT AC AT T T 
T TAT AAC C C T GAG AT T AAAT C AAC T T T T C T TAT AC AG T CAG AC AG GAT C T G C AC AT AAT G 
T T CAACACAAT GAT AAAAT AAGAGAAAT AT AT T T AAAAAAAC TAACAT CCAGTGT G AAT T 
C C TAG AT AT AT G G C T AC AAAAAAAT C TAT AT T C CAT C T G T AC AAT AAC AAG AAT AT C AAT 
T G AAAAAAGAAT AC T TAT GAAC AT GAT AGAT GAT AT TAG T T T C T G TAG AG GAT CAT T AC T 
ATGATGGAAGTGATGATTTTAGCATGTTTGGTTTGGTCTAATCTACATTGTATATATGAC 
AT T C T C C AAC AAAAAC AT G C AGAT AT AAT TTTCTCTTTGC C AAG C AAAGAC GAT T T C T T C 
AT AAAG T C T G T AT CAG T T AAAC C G T T C AC T T T GG AG TAT AT AG AG CAG G C T AC AG AAC AG 
TACAAAACTTTAATAGGTGAAGGAGGATTTGGCTCTGTTTACAGGGGCACTCTAGACGAT 
GGTCAAGAAGTGGCAGTGAAAGTGCGGTCATCCACATCAACTCCGGGAACCCGAGAATTT 
GAT AAT G AGG TAT AAT AT GC AT TAT CAC T T TAT TAG AG C AACAAGAT T C C C AC AG AATG T 
TCCTAATTAAAGCTTTGGCCATATATATAGTAGTTTAAGAATTTGTTCGGTTCCTATCAC 
AC CAT C T GAT T GG T C TAT GAT G GAT CAT G C AGC T AAAC C T AC T T T C AGC TAT AC AAC AT G 
AGAACCTGGTGCCTCTTCTGGGTTACTGTAATGAGTATGATCAACAAATTCTCGTGTATC 



SUBSTITUTE SHEET (RULE 26) 



WO 02/102841 



PCT7HU02/00055 



11/60 

Figure 8 . ( con t . 3 / 3 ) 

C T T T CAT G T C C AAT GGCTCTTT GC TAG AT AG AC TAT AC G G T AAAT AT C T C AC AAT T C T T T 
T AGC AG AT AT G T T T T AAT AC AC AG T AAT T C T G AAAT T T AAT T T CAT AAC AG GGG AAG CAT 
CAAAGAGAAAAATATTAGACTGGCCAACTAGACTCTCTATTGCTCTCGGTGCAGCTCGAG 
GCAAGTACCATGATGTTGTTTTACTTTTACTAAGTTGTGCATATGTACTAATTAAGCTTG 
CAT CAT AT C AG AG AG G AT AAAAC AT GAAAT T G TAT G T AAAC AT T T CAT AT GAAAAC AT AA 
AAC T AAT T C AAAT G AAC AAT G AAAAAAG AG AAG C G TAT AC AAT GAT TAT G T C T T AC C AC A 
GGCG CAGGGAC T T GAATATAAGATACATAT T GAT T T CT C T AC T GAACAT ATAAACTGGAG 
AC AAT C AC AAG C AAAAT GAT GAG G C TAT AT T GAT T G AC AAG G AC AAT CAT AT T G T T T T AC 
T G T GAT TAG AAC GAT AC AG T AAAG AT A T T T TAG AG G G AAT AC T G T T T AC T AC TAT AAG AG 
TGACTGATAGTTCGGATGTTGCTAATGGAAATTGATAAGATGGTCAACATTGACTTAGTG 
AAATCTGTCTGACAACTAAGTATTTTGGCCCTCACGGATCTACTTTGGCATAAAGAAAAA 
CTTCTTGAAATGATGCATCCTAAAGTTCTTTATTCTTCACATAATAAACTCAGTTTTTGA 
CCTAACTGACTTCACTACAGTTTTCGTCCATAACATATTTAGGACACTAAAATGATCATA 
AG T T T T C T GAT T G AGC AG G T T T GGC AT AT C T T C AC AC AT T T C C AGG AC G T T C T G T AAT AC 
AC AG G G AC G T AAAAT C GAG C AAT AT AC T GC T G GAT C AG AG CAT G T G T G C T AAG G T T G C AG 
ATTTTGGTTTCTCAAAATATGCTCCTCAGGAAGGAGACAGTTATGTTTCCCTTGAAGTAA 
GAGGAACTGCAGGGTATCTGGATCCTGAGTAAGTGAACTTAAATCCGTCTTAATCTGAAT 
GT C T CAT AC TGCTCTCT CAT C TAT C AC T T T C AG C AAAAT AT T C AAT AC AT T T C C C AC AC A 
C AAG T T T G T T G G AACC AT C AG AAAAC T G AAG AAAT G T T T C T GAT TAT AT TCCTATTCT G A 
AT C T G AAG T G TAT AT T AA T T T T AAAC AT AC AAAAC T AAAG T T C C CAT AT G G T G AC T AC AA 
AT GAT AAG A GAT AT T TAT CC AT G T G AA T T T G AAT G G GAT G AAG AAG AAT C T AAC AT AC C T 
AG T C C CAT T T G T T T C AC A T G T T AC AG AG AG T AAC CAT G AC AT G AAC AAAT T T C AC AAT AG 
C T G A C G T T TAT C CAT AT G C AG G TAG TAG AAAAC C C AG C AAT TAT C T G AAAAAAG T GAT G T 
TTTCAGCTTTGGTGTGGTTCTTCTTGAAATTGTAAGCGGACGGGAACCTCTCAACATAAA 
GAGACCAAGGATCGAGTGGAGCTTGGTTGAATGGGTATGATCCATGACCCCATTTTTTTT 
AC AAAT AT AAT T T T CAAG T GGT T CAT AT AT T T T AGT AGGT ACT AACAC AT AACT T T CAT G 
C AG G C T AAAC CAT AC AT AAGAG CAT C AAAG G T GG AT GAAAT T G T AGAT C C T G GCAT C AAG 
GGAGGATATCATGCAGAGGCATTGTGGAGAGTTGTGGAAGTAGCACTACAATGTCTAGAA 
CCCTACTCAACATATCGGCCATGCATGGTTGATATTGTCCGCGAGTTGGAGGATGCTCTC 
ATTATTGAAAACAATGCATCTGAATACATGAAATCCATAGACAGCCTTGGAGGATCCAAC 
C G C T AC T CAAT T G T TAT G G AC AAAC GG GC G C T GC C T T C AAC T AC AT C TAG AG C AG AAT C A 
ACTATCACAACCCAAACCTTGTCACACCCTCAACCGAGATAGTAAATGGGTCGATGGAAT 
TCTTTTGATTTGTTTTTGATCATTGCTTTAGTAATATCACATTTTAAATGGTAAAGGAGA 
AAAAT ACTACTTCTGATTGTATTTCCATCCACTCTATGTTTCTTGAAACTGAATCTCTCT 
TGCTCAGCCCCAGTTTTTATGGGTGAAAAGAATAATTTGGGTCAAATGCAAGTGAAACCA 
TAT G G T G CAT AAT T T AAAAGCC AT AT C ATAT CAT T T G C CAAG T C C AAAG T AAAAAT T T C A 
C AAAC TAG T TAG AT T G C GAT T T AG T C TAT AG AC AC T T C AAC AG AG C TAT AT AC AC TAT G G 
TTGACTTGCGACTAATTCGCTCAAGCAGGAGGAACACATATATATGGGAAACACTTGTAG 
AACTATTTTGTTTATAGAAATGAAAATATTTTCCGTTTTTAATTTTTGTTTTCATTAAGT 
TTGAATTATTTTTACTTTGGTTCATACATGATTCTAAATTATGTTTGAAGTTATGGTAAG 
AAAGTTGTCATTGCGATWVAATACGGCCGCAATTGCCGCTGTGCATCTTCAGCTCAAGTG 
ATTGTCTCTTGGTGATTGTTCAACAGACCCTCAACATTGCTCATCCTATCAGAACCCAAC 
TTCCAATAGGCATTTGTTCTACTTAAAAGAAAATCGGAAGCTAGC 
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Forward primers: 


1 Plinth 


Start 


Pynn/lntrnn 
piuii/iiiuuii 


Pr RKU5: 


5'-AGGGGTTACTTGCTCTTTTGTQGT-3' 


OA 

<£4 


poini \V\/) 

70 

it. 


position 

prom, 


Pr.RKUB 


S'-ATCTCTMCTCGTCTTCCATCTW 


l& 


84/ 


prow, 


Pr RKU4 


5'-TTCCAGGCCTAAAGTCAAACACCA-3' 




QA7 




Pr RK-ATG 


5'-TGTCTTTGGTACCATGATGGAGTTACAAGTTA-3' 


32 


1139 


El 


Pr_RKEx2U 


5'-ATGGATCCGGGTTTGAGAGCATAGCATG"3' 




1 900 




Pr RKU3 


5''TGAGGGAGTTTTTAGAGCCACCAAAG-3' 


26 


1892 


E2 


Pr_RKU2 


5'-CACCCTGAGCGATTGGAGTTC-3' 


I 1 


£540 


CO 
L y 


PrJst-AS 


KAMTCCTGCAGGCACGGTCGTGG-3' 


% 




E3 


Pr RKU7 


5'-MGCATTGGAGAGTTGGAGTGG-3' 


00 


W7AW 


E4 


Pr RKU2X 


5-AAACAAGCACATGCAGATTAT-3' 


0\ 


*y\QQ 


16 

1 w 


Pr„RK-ECO-5' 


S , -GGCTGTTGGMTTCTATTrrTTTGC.3 l 


25 


4947 


E9 


Pr.RKUl 


5'-TACAGGGGCACTCTAGACGATGGT"3' 


24 


5679 


E1Q 


Prjsi-B5 


5'"AAGAGGAACTGCAGGGTATCTGG-3' 


23 


6956 


E13 


Pr.RKUO 


5'-GCAGAGGCATTGTGGAGAGT'3' 


20 


7631 


E15 


rvsvciwc i^i it i iv 

Pr RKD5 


arc* 

WGGGTGGTTTTTGGATCTGTGTAA-3' 


24 


1621 


E2 


Pr_RKD4 


5-TTAAAACACTAGTACX1GAAACC-3 1 


22 


2028 


E2 


Pr.RK.D3 


5'-ACCCTCCAGCGAAAACATCAAA"3' 


22 


3120 


E3 


Pr_PsW3 


5'"ACGACCGTGCCTGCAGGATTTCA-3' 


23 


3246 


E3 


PrRKD2x 


5'-TAAATGTGGCAGTGAGATAATGGA-3' 


24 


4582 


E7 


Pr_RK-ECO-3' 


S'-GCAAAAAMTAGAATTCCAACAGCW 


25 


4972 


E9 


Pr_RKD2 


5'"CCCTGAGTTGATGTGGATGA03' 


21 


5746 


E10 


Pr_RKD7 


5'-GATACCCTGCAGTTCCTCTTA<3' 


21 


6976 


E13 


Pr RKD1 


5M3GCCGATATGTTGAGTAGGGTTCT-3' 


24 


7698 


E15 


Pr_RK-STOP 


5-AAAAGAGCTCCATCGACCCATTTACTATC-3' 


29 


7924 


E15 


Pr.RKDO 


S'-GMTTAGTCGCAAGTCMCW 


20 


8236 


E15 




-h — mi — Hf 



o 

H 

o 



o 



exon 

► ► ►►►►►► jj 

« « < < < < 4< 4 | 

o 
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Figure 10. 

TTCCAGGCCTAAAGTCAftACACCATATTTTAACAATATTCTTTCTTCTACAGCTTTTATA 
CAAGTTCACTATATTATAGGATTGATCAGGGTTCATTTTTTCTTTCTTTGAAAAATCTCT 
AAGGG G TGCTGTTTC CAAGG C AG AG AAT G AAAT AG AAT T C AG AAGAAT T T T TAT G T T AC T 
AT AAAG G AAAG AT G AAAAG T TAG T TAG CAT G GAT T C AAG T T T G AT AAC C C TAT G G G G T AA 
AATCTCTTTCAGATTATGATGGAGTTACAAGTTATTAGGATATTTAGATTGGTTGTGGCA 
TTTGTTCTTTGTTTGTGTATATTTATCAGATCAGCTTCTTCTGCAACTAAAGGGTTTGAG 
AGCATAGCATGTTGTGCTGATTCAAATTACACAGATCCAAAAACCACCCTAACTTATACA 
AC AG AT C AC AT CTGGTTCTCT GAT AAAAG AAG T T G C AG AC AAAT AC C C G AAAT T T T G T T T 
AGCCAC AGAAGCAAT AAAAAT GT TCGAAAAT T T G AAAT AT AT G AAG G AAAG AG AT G T T AT 
AATTTGCCAACAGTTAAGGATCAAGTATATTTGATAAGGGGCATATTTCCCTTTGATAGT 
TTAAATTCTTCGTTTTATGTTTCGATCGGGGTAACAGAACTAGGCGAATTAAGATCGTCT 
AG G C T C GAG G AC T T GG AAAT T GAG G GAG T T T T TAG AG C C AC CAAAGAC T AC AT AG A T T T C 
T G C T TAT T GAAGG AG GAT G T C AAT C C C T T CAT T T C T CAG AT T G AAT T GAG G C CAT T AC C T 
G AAG AAT AC C T AC AT GGTTTCGGTACTAGTGTTT T AAAAC T GAT AAG C AG AAAC AA T C T T 
GG T GACAC AAAT GAT GAT AT AAGGT T CC C AGAT GAC CAAAATGAT AGAAT CT G GAAAC G G 
AAAGAAACTTCAACTCCAACATCTGCCCTTCCACTGTCTTTCAATGTCAGCAATGTTGAC 
CTCAAAGACAGTGTCACACCTCCTCTACAAGTCCTACAAACAGCTCTTACTCaCCCTGAG 
CGATTGGAGTTCGTCCATGATGGCCTCGAGACCGATGATTATGAATACTCTGTGTTTCTC 
C AC T T T C T T GAAC T AAAT G G C AC T G T C AG AGC AG G AC AAAGGG T G T T T GAC AT C TAT C T A 
AACAATGAGATTAAAAAGGAGAAATTTGATGTTTTGGCTGGAGGGTCCAAG7VACAGTTAC 
AC T G CC T T GAAC AT T T C AGC AAAT G GAT C AC T C AAT AT AAC C T TAG T C AAG G CAT C T G G A 
TCTGAGTTTGGACCCCTTTTGAATGCCTATGAAATCCTGCAGGCACGGTCGTGGATTGAA 
GAGACCAACCAAAAAGAT T T GGAAGT T AT T C AGAAGAT GAGAGAAGAAC T GCTGCT GC AC 
AACCAAGAAAATGAAGCATTGGAGAGTTGGAGTGGAGACCCTTGTATGATTTTCCCCTGG 
AAAG G AAT AAC AT G T GAT GAT T C AAC T G G T T CAT C TAT TAT C AC T AAGC T GG AT C T T T C T 
T C C AAT AAT C T CAAGGG AG C AAT T C C T T C CAT T G T C AC T AAG AT GAC C AAT T T AC AAA T A 
CTGAACCTGAGCCACAACCAGTTCGATATGTTATTCCCCTCGTTTCCACCGTCCTCCTTG 
CTGATATCATTGGATCTTAGCTACAATGATCTTTCAGGATGGCTTCCAGAATCCAT TATC 
T C AC T G C C AC AT T T AAAAT C AT TAT AT T T T G G C T GC AAT C CAT C TAT GAG T GAC G AAG AT 
AC AAC AAAG T T GAACA G T T C AC T AAT C AAT AC AG AT T AT G G GAG AT G C AAAG C AAAAAAA 
C C AAAGT T T G GAC AAG TAT T C G T GAT T G GAG C T AT TAG AAG T G GAT C AC T T T T GAT T AC T 
TTGGCTGTTGGAATTCTATTTTTTTGCCGTTATAGACACAAGTCAATTACTTTGGAAGGA 
T T T G G AAAG AC C T AC C CAAT GGCAAC AAAT AT AAT TTTCTCTT T GCCAAG CAAAGAC GAT 
T T C T T CAT AAAG T C T G TAT CAG T T AAAC C G T T C AC T T T G GAG TAT AT AG AG CAG G C T AC A 
GAACAGTACAAAACTTTAATAGGTGAAGGAGGATTTGGCTCTGTTTACAGGGGCACTCTA 
GAC G AT GG T C AAG AAG T G G CAG T G AAAG T G C G G T CAT C C AC AT C AAC T C AG GG AAC C C G A 
G AAT T T GAT AAT G AGC T AAAC C T AC T T T C AGC TAT AC AAC AT G AGAAC CTGGTGCCTCTT 
CTGGGTTACTGTAATGAGTATGATCAACAAATTCTCGTGTATCCTTTCATGTCCAATGGC 
TC T T T GC T AGAT AGAC T AT ACGGGGAAGC AT CAAAGAGAAAAAT AT TAGAC TGGCC AAC T 
AGACTCTCTATTGCTCTCGGTGCAGCTCGAGGTTTGGCATATCTTCACACATTTCCAGGA 
CGTTCTGTAATACACAGGGACGTAAAATCGAGCAATATACTGCTGGATCAGAGCATGTGT 
GCTAAGGTTGCAGATTTTGGTTTCTCAAAATATGCTCCTCAGGAAGGAGACAGTTATGTT 
T C C C T T G AAG T AAG AG GAAC T GC AG GG TAT C T G GAT C C T GAG T AC T AC AAAAC C CAG C AA 
TTATCTGAAAAAAGTGATGTTTTCAGCTTTGGTGTGGTTCTTCTTGAAATTGTAAGCGGA 
CGGGAACCTCTCAACATAAAGAGACCAAGGATCGAGTGGAGCTTGGTTGAATGGGCTAAA 
CCATACATAAGAGCATCAAAGGTGGATGAAATTGTAGATCCTGGCATCAAGGGAGGATAT 
CAT G CAG AG G CAT T G T G GAG AG T T G T GG AAG TAG C AC TAG AAT G T C TAG AAC C C T AC T C A 
ACATATCGGCCATGCATGGTTGATATTGTCCGCGAGTTGGAGGATGCTCTCATTATTGAA 
AAC AAT G CAT C T GAAT AC AT G AAAT C CAT AG AC AG C C T T G GAG GAT C C AAC C G C T AC T C A 
ATTGTTATGGACAAACGGGCGCTGCCTTCAACTACATCTACAGCAGAATCAACTATCACA 
AC CCAAACCT T GTCACACCC TC AACCGAGAT AGT AAAT GGGT CGATGGAAT T C T T T T GAT 
TTGTTTTT GAT CAT TGCTTTAG T AAT AT C AC AT T T T AAAT G G T AAAG GAG AAAAAT AC T A 
CTTCTGATTGTATTTCCATCCACTCTATGTTTCTTGAAACTGAATCTCTCTTGCTCAGCC 
CCAGTTTTTATGGGTGAAAAGAATAATTTGGGTCAAATGCAAGTGAAACCATATGGTGCA 
T AAT T T AAAAG C CAT AT CAT AT CAT T T GC C AAG T C C AAAG T AAAAAT T T C ACAAAC T AG T 
TAG AT T G CG AT T TAG T C TAT AG AC AC T T C AAC AG AGC TAT AT AC AC T AT GG T T GAC T T G C 
GA 
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Figure 11. (1/5) 

Nhel 

GCTAGCTTAATTTGTGAAGTAGTTTTCAGTCAGCTCATGCTGAACAGTTATGGTAGTTTT BAC A17 

ACAAGGATGTAAGGGGTTACTTGCTCTTTTGTGGTATCATTGAGCCACTTTCCACTTTCT BAC Al 7 

TAAT TCTTTGCTATCAAATT TAAATATAT TAT AT TAT T GTAGGAAATATGAAAAAGAAAC BAC A17 

CCTAKATGCTTTAATTGTGCTAAGCATATAAGATACAA2^AaAAAAA2?AGTCTAATCTCTT BAC A17 

TTGATAGGAAACTGACACAAGGTTCGATCCTCAGAACC^ACTATGAGACTAATTTTTGCT BAC A17 

TGAATTGGTTAATCAACTGCATCGCACTCATTGAATAAAAAAGTACGTTrAGATTGACGG BAC A17 

T GAAT T TAACATAATCAC AG T TGACCACT TTGAT T T TCACGAAAGTTGCATGTAACT TT C BAC Al 7 

GT GC T TGAATATAAAGTCAC GGTGCCACAATGGT T TTGT CAATATCGCCGKGAATCCAAA BAC A17 

CATGCACTTAATATGT T T TTATAT TAAT GGAAACCTATGTATATAAATATAAAATACTTA BAC A17 

AGCATATATATCCTTGCAACAATACCAAAAAAGTTTAAAATTTGCTCCAAAGATACTATT BAC A17 

CACGGT T T C TAT T GTGAT GC AT CCAT GGAATGAGAAGGAAGAAAACAGAAGAAGAC GAAG BAC A17 

GACT GGTCTGTCCAAT TATT TAAACCATATGAATAT T T T TATATACAGATAATGAAGCAA BAC A17 

CCACCACTAAAACAATAAGAATAAATAATAATGTTCAACATCATCAATAATAAGAATAAT BAC A17 

AATGTTTAACATCAACAATAAAAC TATAGGAAAAAAACATAAT CAAC TATGCAT TGTAC T BAC A17 

AATTCAATCTCTAACTCGTCTTCCATCTCTTTCCTAGCTACCTCCTGCAGTTTCCT BAC A17 

T TCCAGGCCTAAAGTCAAACACCATATT T TAACAATAT TCT TTCTTCTACAGCT T T TATA BAC A17 

T TCCAGGCCTAAAGTCAAACACCATATT T TAACAATATTCT TTCTTCTACAGCT TTTATA cDNS Al 7 

CAAGTTCACTATATTATAGGATTGATCAGGGTTCATTTTTTCTTTCTTTGAAAAATCTCT BAC A17 

CAAGT TCACTATATTATAGGATTGATCAGGG T TCAT T T T T TC TTTCT TTGAAAAATCTCT cDNS Al 7 

AAGGGGTGC TGTTTCCAAGGCAGAGAATGAAATAGAATTCAGAAGAATT TT TATGT TAC T BAC A17 

AAGGGGTGCTGTTTCCAAGGCAGAGAAT GAAATAGAATT CAGAAGAAT T TT TAT GT TACT cDNS A17 

ATAAAGGAAAGATGAAAAGT TAGT TAGCATGGAT T CAAGT T T GATAACCCTATGGGGTAA BAC A17 

ATAAAGGA2VAGAT GAAAAGT TAGT TAGCATGGAT TCAAG T T TGATAACCCTATGGGGTAA cDNS A17 

AATC TCTT T CAGATTAT GAT GGAGT TACAAGT TAT TAGGATATTTAGATTGGT TGT GGCA BAC A17 

AATCTCTTTCAGAT TATGAT GGAGT TACAAGT TAT TAGGATAT T TAGAT TGGT TGT GGCA cDNS Al 7 

T TTGT TCT TT GT TTGT GTATAT TTATCAGATCAGC TTCT TCT GCAACTAAAGGT TAGTAG BAC A17 

T T TGT TCT T TGT TTGT GTATAT TTATCAGATCAGC T TCTTCTGCAACTAAAG cDNS A17 



C T AAAT AC T AT AAT TCTT T AAG AT CAT AAT AAT AT C T AT TAC T T GAT TTCTTTCCCTTTA 
AACATAGAAAACAACATATT T TAAT TAACATGAAAGGCCATGGGAT GAT C ATAAT TAATA 
AT GAGAAGAAAATAAAT AAGAAAC TTGTTTTTTT T ACAAGGAT T GTAAAAT AGAACT AG T 
AGTT TCATACTTTCAATACTGAGAATCT TGAAACAAT T TCACT T TTTCT T TATGT T GC TA 
GAAT T TCTT TCAAGGGAAAAT CCAAT TT TGTACAAAAT GAAT T TAACT TGTGACAT TT TC 
CTTGTAG 



II 



GGTT TGAGAGCATAGCATGT TGTGCTGAT TCAAAT TACACAGATCCAAAAACCACCC TAA BAC A17 

GGTT TGAGAGCATAGCATGT TGTGCTGATTCAAAT TACACAGATCCAAAAACCACCCTAA cDNS A17 

C T TATACAACAGATCACATC T GGT TC TCTGATAAAAGAAGTT GCAGACAAATAC CCGAAA BAC A17 

C TTATACAACAGATCACATC T GGT TCTCTGATAAAAGAAGT TGCAGACAAATAC CCGAAA cDNS A17 

T T T TG T T T AGCCACAGAAGCAATAAAAAT GT T CGAAAAT T T GAAATATAT GAAG GAAAGA BAC A17 

T T T T GT T TAGCCACAGAAGCAATAAAAAT G T T CGAAAAT T T GAAATATAT GAAG GAAAGA cDNS A17 

GATGT TATAATTTGCCAACAGTTAAGGATCAAGTATAT TTGATAAGGGGCATAT TTC CC T BAC A17 

GATGT TATAATT TGCCAACAGT TAAGGAT CAAGTAT AT T T GATAAGGGGCAT AT T T CCCT cDNS A17 

TTGATAGTTTAAATTCTTCGTTTTATGTTTCGATCGGGGTAACAGAACTAGGCGAATTAA BAC A17 

T TGATAGT TTAAATTCT TCGT T TTATGTTTCGATCGGGGTAACAGAACTAGGCGAAT TAA CDNS Al 7 

GATCGTCTAGGCTCGAGGAC TTGGAAATTGAGGGAGT TTT TAGAGCCACCAAAGACTACA BAC A17 

GATCGTCTAGGCTCGAGGAC TTGGAAATTGAGGGAGT TTT TAGAGCCACCAAAGACTACA cDMS A17 

TAGATTTCTGCTTATT GAAGGAGGATGTCAATCCC T TCATT TCTCAGATT GAAT TGAGGC BAC A17 

TAGATTTCTGCTTATTGAAGGAGGATGTCAATCCCTTCATTTCTCAGATTGAATTGAGGC cDNS Al 7 
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Figure 11. (confc. 2/5) 

CATTACC TGAAGAATACC TAGATGGTTTCGGTACTAG TGTTTTAAAAGTGATAAGCAGAA BAG All 
CAT TACC TGAAGAATACC TACATGG T TTCGG TAC TAG TGT T T TAAAAC TGATAAGCAGAA cDNS A17 

ACAATCT TGG TGACACAAATGATGATATAAGGTAT G TGATC TTACT T TAT TTT TAGGT AG BAG All 
ACAATC T TGGTGACACAAATGATGATATAAG cDNS A17 

ATT CCACCTCTAT T T TACAGGGAGTGTCTCT CAGGAAACCTAAAAGGCT TAGGGT T TGT C 
ACTACTTGGT TCTT ATGGAAGCATCATATGT TCATACT TAGTAGATATATAT T GTATAAA 
AAT CAACAT C T T T C T C C GTAAG GGATAGAAAT T GCAA T TCAT GT TATGT T GAGCATAT T T 
AAACCAAAATAATTTGAGGAATATGATGCATGAACCT TTCT GCCAAATGCATGACATAAC 
C T AT GT T TCAC T TT AC ATATAGAAT TATAAGATGTGT T TACAT CT T T ATAT TAAC T T TGA 
TCTT TATGATGCAC TGAGATACACC TGAAT TTAAATAACAAATGGAGAAGCAAGAATCTA 12 
GAACT CTACCAT TCAATCCATT GT AATAT GGAACT CACAAGAAT TAAGTATTGCCACT TC 
AGATTAAATACCCTAAGAGTGT T T T TGTGGAT GAGGGAAT G T T TTGAGGGAATGTAATGT 
T TTGAGGGAAT TCC TCAAAACAT TACAT TACCTCAAAATAT TCCCCCATC CAAACACAC T 
ATAAAGGATCCTAATCTAGTGTAGCCTTAGTTTTCTATTGAAGTGATTTGTGTTTAGAGT 
TTGGTCTGCATCTGCAACTTTGTTTTGTTGCTGTAATGCCATGACTATGAACACTGACAG 
AAAGGTCTCATAATATCGGTATCTATCAATTTTAG 

G T T CC C AG AT G ACC AAAAT G AT AGAAT CT G G AAACGGAAAGAAAC T T CAAC T CCAAC AT C BAG A17 
GTTCCCASATGACCAAAATGATAGAATCTGGAAA cDNS A17 

TGCCCTTCCACTGTCTTTCAATGTCAGCAATGTTGACCTCA2\AGACAGTGTCACACCTCC BAC A17 
TGCCCTTCCACTGTCTTTCAATGTCAGCAATGTTGACCTCAAAGACAGTGTcmC^CCTCC cDNS A17 

TCTACAAGTCCTACAAACAGCTCTTACTCACCCTGAGCGATTGGAGTTCGTCCATGATGG BAC A17 
TCTACAAGTCCTACAAACAGCTCTTACTC^ cDNS A17 

CCTCGAGACCGATGATTATGAATACTCTGTGTTTCTCCACTTTCTTGAACTAAATGGCAC BAC A17 
CCTCGAGACCGATGATTATGAATACTCTGTGTTTCTCCACTTTCTTGAACTAAATGGCAC cDNS Al 7 

T GTCAGAGCAGGACAAAGGGTGT T TGACATCTATC TAAACAATGAGAT TAAAAAGGAGAA BAC A17 
TGT CAGAGC AGGAC AAAGGGTGT TTGAC ATCT ATCT ftAACAATGAGATTAAAAAGGAGAA cDNS A17 

ATT T GATGTT T TGGC TGGAGGGTCCAAGAACAGT TACACTGCC T TGAACAT T TCAGCAAA BAC A17 
ATTTGATGTTTTGGCTGGAGGGTCCAAGAACAGTTACACTGCCTTGAACATTTCAGCAAA cDNS A17 

TGGAT CACTCAATATAACCTTAG TCAAGGCATC TGGATCTGAGTT TGGACCCCT TT TGAA BAC Al 7 
TGGAT CACTCAATATAACCTTAGTCAAGGCAT CTGGATCTGAGT T TGGACCCC T T TTGAA cDNS A17 

TGCC TATGAAATCC TGCAGGCAC GGTC G T GGAT TGAAGAGAC CAACCAAAAAGAT TGTAA BAC A17 
TGCCTATGAAATCCTGCAGGCACGGTCGTGGATTGAAGAGACCAACCAAA?\AGATT cDNS A17 



GTGTACAACTAAGAT TGCTAAATNGATGAAT T T TAT TAACCT T TAACCAAT T T T CTATTT 
CATTCTCCCTCTTACACTAACACTTTTTTTCCTTTCAG 



13 



T GGAAGT TAT T CAGAAGATGAGAGAAGAAC T G C T GC T GCACAAC CAAGAAAAT GAAGCAT BAC A17 

TGGAAGTTATTCAGAAGATGAGAGAAGAACTGCTGCTGCACAACCAAGAAAATGiiAGCAT cDNS All 

TGGAGAGT TGGAGT GGAGACCCT T GTAT GAT T TTCCCCTGGAAAGGAATAACATGTGATG BAC A17 

TGGAGAGT TGGAGT GGAGACCC T TG T ATG AT T TTCCCCTGGAAAGGAATAAC ATGTGATG cDNS AX 7 

ATTCAACTGGTTCATCTATTAT CACTAAGCTGTAAGTCCTTCCAC T TT T TAGGGTCT GT T BAC A17 

ATTCAACTGGTTCATCTATTATCACTAAGCT cDNS A17 
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Figure 11* {cotxt. 3/5) 

TAGAT T TGC T TAT T T GAGTT TATC TAT TGAAATAAACAC T TAT GACAC TGT T TGG&AGAG 
CTTATGAAAACi\ACTTATAGTTTATACGMJ^ 

AGAAATAGC TTATiUlGTiVAGAACT TATATGATAAGCGTTTATGCTATAAACGCTCAilTTT 
AACT G TT TATCCAAACAGGACTT TAATGCATCCATATGT GT TTACAAGT TTTGCTCTAT T 
TTCTGCAG 



14 



GGAT C T TT C TTCCAATAATCTCAAGGGAGCAATTCCT TCCATT GTCACTAAGATGACCAA BAC A17 

GGATCTTTCTTCCAATAATCTCAAGGGAGCAATTCCTTCCATTGTCACTAAGATGACCAA cDNS A17 

T TTACAAATAC Tg£AC TGCTTCTAAATCTCATTTAAA^CATCCACAT TATTTTTGTGGG BAC A17 

TTTACAAATACT cDNS A17 

AACAG TGGAC TG T T T T T ATC CAT T AGT TAATAATG T T G TAAC AT TTT1GT TATGCGCAG 15 

GAACCTGAGCCACAACCAGTTCGATATGTTATTCCCCTCGTTTCCACCGTCCTCCTTGCT BAC A17 

GAACCTGAGCCACAACCAGTTCGATATGTTATTCCCCTCGTTTCCACCGTCCTCCTTGCT cDNS Al 1 

GATAT CAT TfiJAAT TATCTCT TCT CATG T T TAACAif\AT GAACTAGGAT GATACT AAFATG BAC A17 

GATATCATT CDNS All 



AGCTTATAGCACTTCTATTATCCTTGTAAATGTTAACATAAACAAGCACATGCAGATTAT 
AGAAC TAAAAT AT GATAGAATAT G C T G TATATAG T CC CAAC TC ATGT T GGT TACATAAGG 
TAGATATTAGACAGTCTCAATGCTGAAACCAR.TATCTGTGTGCATGATGCATCTTAATCT 
T TAAAAGATTT TATGAT TAAGGCT GT TCT CTATGGCACT TCAG TAGGAGCAGTAACACAT 
CTATGTGAAA.GTTCCATATGAJ\ATCCTTGAGAAATATGTTTTGACATTATGTTTCATATA 
T TGC T GAAT T TCTTT CT TCACCAAGGT TCGAATCT TAGAGAA&T T TCCTACAGT TAT TAG 
T GCC AT TGAA&AAT TACT AC TAAAT C T T T T TCAAAT GATT GAT CT ACAT T TAT GGAT CAG 
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GGATCTTAGCTACAATGATCTTT CAGGATGGCTT CCAGAATCCATTATCT CACT GCCACA BAC A17 

GGATC T TAGCTACAAT GATC T T T CAGGAT GGCT T CCAGAATCCAT TAT CT CACT GCCACA cDNS Al 7 

T T TAA&AT CAT T GTAAGT T T TAT AT GT T GGCAT T C TAG T T CAT CCAT AT TAGGAAGC TAT BAC A17 

TTTAAAATCATT CDNS A17 



TT TCGAT TGT TGT ATAT T TT TAATAT ACT TCAT T T AT TCAGC T T GTAT T TGAT T TAT TTC 
CAG 



17 



ATAT T TTGGCTGCAATCCATCTATGAGTGACGAAGATACAACAAAGT T GAACAGTTCACT BAC A17 

ATAT T T TGGC TGCAAT CCAT CTAT GAGT GACGAAGAT ACAACAAAGT T GAACAG TT CACT cDNS Al 7 

AATCAATACAGAGTAT GAAGTATT AT TGATGCATAACAATAGAilGT TT TTAAAATAjlAAT BAC A17 

AAT CAAT AC AG A cDNS A17 



AAAATAAATCAAAATTTTACTCGTTTCTGTTTTCACAG 
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TTATGGGAGATGCAAAGCAAAAAAACCAAAGTTTGGACAAGTATTCGTGATTGGAGCTAT BAC A17 

TTATGGGAGATGCAAAGCAAAAAAACCAAAGTTTGGACAflGTATTCGTGATTGGAGCTAT CDNS Al 7 

TACAAG TGGATCACTT TTGATTACT T TGGC TGT TGGAATTC TAT T T TT T TGCCG TTATAG BAC A17 

TACAAGTGGATCACTTTTGATTACTTTGGCTGTTGGAATTCTATTTTTTTGCCGTTATAG cDNS A17 

ACACAAGTCAATTACTTTGGAAGGATTTGGAAAGACCTACCCAATGGCAACAAglAHGCA BAC A17 

ACACAAGT CAAT TAG T T T GGAAGGAT TT GGAAAGAC C TAC C CAAT GGCAACAA cDNS A17 



TT TTATCT TT TATAAFC TATTGTAACATATGTCTATC TGCAT TCTAATGTAACATAT T TG 
CTCT T TAGAC TAACA^ACAMAAC TATAGTCACATC ATTCTGCATAGACAATCTCAGATC 
ACAAGATCTAAGT GT TATCT TTA21CA.TAGAGC^TGGTT TACAAATACAGGGAATACAT TT 
TTATAACCCTGAGATTAA^TCA^CTTTTCTTATACAGTCAGACAGGATCTGCACATAATG 
T T QyiCAC^TG AT AA^T AAG AG&A&T AT AT T T;AAA£J\AACT AAC AT CC AG T GT GAAT T 
CC T AGATAT AT GGCTACAAA&AAAT C TAT AT T CCAT C T GTACAAT AACAAGAAT AT CAAT 
TGA&AAAAGAATAC TTATGA&CATGATAGATGATAT TAGTTTGTGTAGAG GATCAT TACT 



19 
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Figure 11. (cont. 4/5) 

ATGATGGAAGTGATGATTTTAGC^TGTTTGGTTTGGTCTAATCTA^ 
ATTCTCCAACAAAAACTJTGCAG 

ATATAATT T TCTCTTTGCCAAGGAAAGACGAT TTC TTCATAAAGTC TGTATGAG TTAAAC BAG A17 
AT AT AAT T T TCT C T TT GCCAAGCAAAGACGAT T T CT T C ATAAAGTC TGTATCAGT TAAAC cDNS A17 

CGTTCACTTTGGAGTATATAGAGCAGGCTACAGAACAGTACAAAACTTTAATAGGTGAAG BAG A17 
CGTTCACTT TGGAGTATATAGAGCAGGCTACAGAACAG TACAAAAC TTTAATAGGTGAAG cDNS A17 

GAGGAT TTGGCTCTGT T TACAGGGGCACTC TAGACGATGGTCAAGAAGTGGCAGTGAAAG BAC A17 
GAGGATTTGGCTCTGTTTACAGGGGCACTCTAGACGATGGTCAAGAAGTGGCAGTGAAAG cDNS A17 

T G C GGT CAT C C AC AT C AAC T C AG G GAAC C C G AGAAT T T GAT AAT G AGCffi AT AAT AT G C AT BAC A17 
TGCGGTCATCCACATCAACTCAGGGAACCCGAGAATTTGATAATGAG cDNS A17 

TATCACTTTATTAGAGC^CAAGATTCCCACAGAATGTTCCTAATTAAAGCTTTGGCCAT 110 

ATATATAGTAGTTTAAGAATTTGTTCGGTTCCTATCACACCATCTGATTGGTCTATGATG 

GATCATGCM 

CTAAACCTACTTTCAGCTATACAACATGAGAACCTGGTGCCTCTTCTGGGTTACTGTAAT BAC A17 
CTAAACCTACTTTCAGCTATACAACATGAGAACCTGGTGCCTCTTCTGGGTTACTGTAAT cDNS Al 7 

GAGTATGATCAACAAATTCTCGTGTATCCTTTCATGTCCM BAC Al 7 

GAGTATGATCAACAAATTCTCGTGTATCCTTTCATGTCCAATGGCTCTTTGCTAGATAGA cDNS A17 

C T ATACGg£AAATAT C T CACAAT T C T T T T AGCAGATAT GT T T TAATAC ACAGTAAT T C T G BAC A17 
CTATACG cDNS All 

AAAT T TAAT T TCATAACAG 111 

GGGAAGCAT CAAAGAGAAAAATAT TAGAC TGGCCAAC TAGAC TC TC TATT GCTC TCGGT G BAC A17 
GGGAAGCATCAAAGAGAAAAATATTAGACTGGCCAACTAGACTCTCTATTGCTCTCGGTG cDNS A17 

C AGCTCGAGGCAAGT ACCATGATGT TGTT T TACT TT T ACTAAGT TGTGC AT ATGTACT AA BAC Al 7 
CAGCTCGAG cDNS A17 

TT AAGCTT GCAT CAT AT CAGAGAGGAT AAAAC ATGAAATTGT ATGT AAAC AT TT CAT ATG 

AAAACATAAAAC TAAT T CAAAT GAACAATGAAAAAAGAGAAGCGT ATAC AAT GAT TATGT 

C T TACCAC AGGC GC AGGGAC T T GAAT ATAAGAT AC ATAT T GAT T T C TC TACT GAAC AT AT 

AAACT GGAGACAATCAC AAGCAAAATG AT G AGGCT AT ATTGAT TG ACAAGGACAATC AT A 

T TGTT T TACTGT GATTAGAACGATACAGTAAAGATATT TTAGAGGGAATACTGTTTAC TA 112 

CTATAAGAGTGACTGATAGTTCGGATGTTGCTAATGGAAATTGATAAGATGGTCAACATT 

GACTTAGTGAAATCTGTCTGACAACTAAGTATTTTGGCCCTCACGGATCTACTTTGGCAT 

AAAGAAAAACTTCTTGAAATGATGCATCCTAAAGTTCTTTATTCTTCACATAATAAACTC 

AGT TT T TGACCTAAGTGACT TCACTACAGTT T TCGTCCATAACATATT TAGGACACTAAA 

ATGATCATAAGT TTTC TGATTGAGCAG 

GTTTGGCATATCTTCACACATTTCCAGGACGTTCTGTAATACACAGGGACGTAAAATCGA BAC Al 7 
GT T TG GCAT AT C T T CACACAT T TC CAGGACG T TCT G TAATACACAG GGACGTAAAAT CGA cDNS Al 7 

GCAATATACTGCTGGATCAGAGCATGTGTGCTAAGGTTGCAGATTTTGGTTTCTCAAAAT BAC Al 7 
GCAATATACTGCTGGATCAGAGCATGT GTGC TAAGGTTGCAGATTT TGGT TTC TCAAAAT cDNS Al 7 

ATGCTCCTCAGGAAGG^GACAGTTATGT T TCCCT TGAAGTAAGAGGAACTGCAGGGTAT C BAC A17 
ATGCTCCTCAGGAAGGAGACAGTTATGTTTCCCTTGAAGTAAGAGGAACTGCAGGGTATC cDNS A17 

TGGAT CCT GAg^AAGTGAACT TAAAT C CGT CT TAAT CTGAAT GTCT CATACTGCTCTCTC BAC A17 
TGGATCCTGA cDNS A17 

ATCTATCACTTTCAGCAAAATATTCAATAC^TTTGC 

CAGAAAACTGAAGAAATGT T TCTGAT TATATTCCTAT TCTGAATCTGAAG TGTATAT TAA 
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Figure 11. (confc. 5/5) 

CCATGT GAAT T TGAAT GGGATGAAGAAGAATCTAACATACCTAGT CCCATTT GT T TCACA 
TCTTAC AGAGAGTAACCATGACATGAACAAAT T TCACAATAGC TGACGT T TATC CATATG 
CAG 



113 



GTAC TACAAAAC CCAGCAAT TAT C TGAAAAAAGTGAT G T T T TCAGC T T T GG TG T GGT TC T BAC A17 

GTACTACAAAACCCAGCAATTATCTGAAfWkAGTGATGTTTTCAGCTTTGGTGTGGTTCT cDNS A17 

T CT TGAAAT TGT AAGCGGACGGG&ACC T C TCAACAT A?y\GAGACC AAGGAT CGAGTGGAG BAC A17 

TCT TGAAATTGTAAGCGGACGGGAACC T CTCAACATAA&GAGACCAAGGATCGAGTGGAG cDNS A17 

CTTGGTTGAATGGSSATGATCCATGACCCCATTTTTTTTACAAATATAATTTTCAAGTGG BAC A17 

CTTGGTTGAATGG cDNS A17 



TTCATATATTTTAGTAGGTACTAACACATAACTTTCATGCAG 
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GCTAAACCATACATAAGAGCATCA2\AGGTGGATGAAAT TGTAGAT CCTGGCATCAAGGGA BAC A17 

GC TAAAC CATACATAAGAG CATCAAAG G TGGATGAAAT TG TAGAT CC TGGCATCAAGGGA cDNS A17 

GGATATCATGCAGAGGCATTGTGGAGAGTTGTGGAAGTAGCACTACAATGTCTAGAACCC BAC A17 

GGATAT CATGCAGAGGCAT TGTGGAGAGT TGTGGAAGTAGCACTACAATGTCTAGAACCC cDNS A17 

TACTCAACATATCGGCCATGCATGGTTGATATTGTCCGCGAGTTGGAGGATGCTCTCATT BAC Al 7 

TACTC^C^TATCGGCCATGCATGGTTGATATTGTCCGCGAGTTGGAGGATGCTCTC^TT cDNS Al 7 

ATTGAAAACAATGCATCTGAATACATGA^TCCATAGACAGCCTTGGAGGATCCAACCGC BAC A17 

ATTGAAAACAATGCATCTGAATACATGAAATCCATAGACAGCCTTGGAGGATCCAACCGC cDNS A17 

TAC T CAAT T GT TAT GGACAAAC GGGC GC T GC C T T CAAC TACAT C TACAGCAGAAT CAAC T BAC A17 

TACTCAATTGTTATGGACAAACGGGCGCTGCCTTCAACTACATCTACAGCAGAATCAACT cDNS Al 7 

ATCACAACCCAAACCTTGTCACACCCTCAACCGAGATAGTAAATGGGTCGATGGAATTCT BAC A17 

AT CACAACCCAAACC T T GT CACACCCT CAACCGAGAT AG TAAATGGGTCGATGGAAT T C T cDNS A17 

T T TGAT T TGTTTTTGATCAT T GCT T TAGTAATATC AC AT T T TAAATGGTAAAGGAGAAAA BAC A17 

TTTGATTTGTTTTTGATC^TTGCTTTAGTAATATCACATTTTAAATGGTAAAGGAGAA^ CDNS A17 

ATAC TACTTCTGAT TGTAT TTCCATCCACTC TATG T T TC T TGAAACTGAATC TC TCT T GC BAC A17 

ATAC TACTTC TGAT TGTATT TCCATCCAC TCTATGT T TC TTGAAACTGAATC TC TC TTGC CDNS Al 7 

T CAGC CC CAGT T T T TAT GGGT GAAAAGAATAAT T TGGGTCAAAT G CAAGTGAAAC CATAT BAC A17 

TCAGCCCCAGTTTTTATGGGTGAAAAGAATAATTTGGGTC^^ cDNS A17 

GGTGCATAA.TTTAAAAGCCATATCATATCATTTGCCAAGTCCAAAGTAAAAATTTCACAA BAC Al 7 

GGT GCATAAT TTAAAA^GCCATATCATATCATT TGCCAAGTCCAAA.GTAAAftAT T TCACAA cDNS A17 

ACTAGTTAGATTGCGATTTAGTCTATAGACACTTCAACAGAGCTATATACACTATGGTTG BAC A17 

ACTAGTTAGATTGCGATTTAGTCTATAGACACTTCAACAGAGCTATATACACTATGGTTG cDNS A17 

AC T TGCG AC TAATTCGCTCAAGC AGGAGGAAC AC AT ATATATGGGAAACACT TG TAGAAC BAC Al 7 

ACTTGCGA cDNS A17 

TAT T T TGT TTATAGAAATGAAAATAT T TTCCGT T T T TAATT TTTGTT T TCAT TAAGTT TG BAC A17 

AATTATT T T TACTTT GGT TCATACATGAT TCTAAAT TATGTT TGAAGTTATGGTAAGAAA BAC A17 

GTTGTCATTGCGAAAAAATACGGCCGCAATTGCCGCTGTGCATCTTCAGCTCAAGTGATT BAC A17 

GTCTCTTGGTGATTGTTCAACAGACCCTCAaCATTGCTCATCCTATCAGAACCCAACTTC BAC A17 

CAATAGGCATTTGTTCTACTTAAJ^GAAAATCGGAAGCTAGC BAC A17 

Nhel 
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Figure 12 . 



MMELQVIRIFRLVVAFVLCLCIFIRSASSATKGFESIACCADSNYTDPKTTLTYTTDH1W 
FS DKRS CRQI PEILFS HRSNKNVRKFEI YEGKRC YNLPT VKDQVYLI RGI FP FDSLNS S F 
YVSIGVTELGELRSSRLEDLEIEGVFRATKDYIDFCLLKEDVNPFISQIELRPLPEEYLH 
GFGTSVLKLISRNNLGDTNDDIRFPDDQNDRIWKRKETSTPTSALPLSFNVSNVDLKDSV 
TPPLQVLQTALTHPERLEFVHDGLETDDYEYSVFLHFLELNGTVRAGQRVFDIYLNNEIK 
KEKFDVLAGGSKNSYTALNISANGSLNITLVKASGSEFGPLLNAYEILQARSWIEETNQK 
DLEVIQKMREELLLHNQENEALESWSGDPCMIFPWKGITCDDSTGSSIITKLDLSSNNLK 
GAIPSIVTKMTNLQILNLSHNQFDMLFPSFPPSSLLISLDLSYNDLSGWLPESIISLPHL 
KSLYFGCNPSMSDEDTTKLNSSLINTDYGRCKAKKPKFGQVFVIGAITSGSLLITLAVGI 
LFFCRYRHKSITLEGFGKTYPMATNIIFSLPSKDDFFIKSVSVKPFTLEYIEQATEQYKT 
LIGEGGFGSVYRGTLDDGQEVAVKVRSSTSTQGTREFDNELNLLSAIQHENLVPLLGYCN 
EYDQQILVYPFMSNGSLLDRLYGEASKRKILDWPTRLSIALGAARGLAYLHTFPGRSVIH 
RDVKSSNILLDQSMCAKVADFGFSKYAPQEGDSYVSLEVRGTAGYLDPEYYKTQQLSEKS 
DVFS FGVVLLE I VS GRE PLNI KRPRI EWS LVEWAKP Y I RAS KVDE I VD PG I KGG YHAEAL 
WRVVEVALQCLEPYSTYRPCMVDIVRELEDALIIENNASEYMKSIDSLGGSNRYSIVMDK 
RALPSTTSTAESTITTQTLSHPQPR* 
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The vertical arrows show the postions of the introns 
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The name of the domain 

Signal peptide 

1. Leucine Rich Repeat (LRR) 
2, 

3. 

Transmembrane domain 
ATP binding site 

Serine-Threonine kinase active site 



t 

405429 aa 
430-454 aa 
455-476 aa 
523-543 aa 
601-623 aa 

717-729 aa 



Putative N-glycosylation sites 

Putative phosphorilation (autophosphorilation) sites 



43-46; 116-119; 257-260; 318-320; 322-325; 326-329; 
436-439; 499-502; 628-631; 673-676; 876-879 aa 



61-63; 65-67; 75-77; 78-80; 98-100; 133-135; 168-170; 
173-175; 193-195; 222-224; 495-497; 581-583; 
685-687; 776-778; 793-795; 855-857; 890-892 aa 



WO 02/102841 



PCT7HU02/00055 



21/60 
Figure 14 . 



TTCCAGGCCTAAAGTCAAACACCATATTTTAACAATATTCTTTCTTCTACAAGGGTATAA 
CTTTTATACAAGTTCACTATATTATAGGATTGATCAAGGTTCATTTTTTCTTTCTTTGAA 
AAATCTCTAAGGGGTGCTGTTTCCAAGGCAGAAAATGAAATAGAATTCAGAAGAATTTTT 
AT GGT AC T AAAG G GAAGAT GAG AAGT TAG T TAG CAT GG AT T CAAAT T T G AT AAC C C T T T G 
G GG T AAAAT CT C T T T C AG AT T AT AAT GG AG C T AC AAG T TAT T AGG AT AT T TAG AT T G G T T 
GTGGCATGTGTTCTTTGTTTGTGTATATTTATCAGATCAGCTTCTTCTGCAACTAAAGGG 
T T T GAG AG CAT AG CAT GTTGTGCT GAT T C C AAT TAG AC AG AT C C AAAAAC C AC C C T AAC T 
T AT ACAAC AGAT C ACAT CT GGT T CT C T GAT AAAAGAAGT T GCAGACC AAT ACC CGAAAT T 
T T G T T TAG C C AC AG AAG C AAT AAAAAT GT T C G AAT AT T T G AAAT AG AT G AAG G AAAG AG A 
TGTTATACTTTGCCAACAATTAAGGATCAAGTATATTTGATAAGGGGTGTATTTCCCTTT 
GATAGTTTAAATTCTTCGTTTTATGTTTATATCGGGGTAACAGAACTAGGTGAATTAAGA 
TCGTCTAGACTCGAGGACTTGGAAATTGAGGGAGTTTTTAGAGCCACCAAAGACTATATT 
GATTTCTGCTTATTGAAGGAGGATGTCAATCCCTTCATTTCTCAGATTGAATTGAGGCCA 
TTACCTGAAGAATACCTACATGGTTTCGCTACTAGTGTTTTAAAACTGATAAGCAGAAAC 
AAT C T T G G T G AC AT AAAT GAT GAT AT C AG G T T C C C AG AT G AC C G AAAT GAT AG AAT C T G G 
AAACGGAAAGCAACTTCAACTCCATCATCTGCCCTTCCACTGTCTTTCAATGTCAGCAAT 
GTTGACCTCAAAGACAGTGTCGCACCTCCTCTACAAGTCCTACAAACAGCTCTTACTCAC 
CCTGAGCGATTGGAGTTTGTCCATGATGGCCTCGAGACCGATGATTATGAATACTCTGTG 
TTTCTCCACTTTCTTGAACTAAATGGCACTGTCAGAGCAGGACAAAGGGTGTTTGACATC 
TATCTAAACAATGAGATTAAAAAGGAGAAGTTTGATGTTTTGGCTGGAGGGTCCAAGAAC 
AGTTACACTGCCTTGAACATTTCAGCAAATGGATCACTCAATATAACCTTAGTCAAGGCA 
TCTGGATCTGAGTTTGGACCCCTTTTGAATGCCTATGAAATCCTGCAGGCACGGTCGTGG 
AT T GAAG AG AC C AACC AAAAAG AT T T G G AAC T TAT T C AGAAG AC AAG AG AAG AAC T G C T G 
C TGCAC AACCAAGAAAAT GAAGCAT T GGAGAGT T G GAGT GGAGACCCT T GTAT GAT T T T C 
CCCTGGAAAGGAATAACATGTGATGATTCAACTGGTTCATCTATTATCACTATGCTGGAT 
CTTTCTTCCAATAATCTCAAGGGAGCAATTCCTTACTTTGTCACTAAGATGACCAATTTA 
CAAATACTGAACCTGAGCCACAACCAGTTCGATTCGTTATTCCCCTCGTTTCCACCGTCC 
TCCTTGCTGATATCATTGGATCTGAGCTACAATGATCTTGATGGACGGCTTCCAGAATCC 
AT TAT C T C AC T G C C AC AT T T AAAAT CAT TAT AT T T T G G C T GC AAT C CAT AT AT GAAG G AC 
GAAG AT AC AAC AAAG T T G AAC AGT T C AC T AAT C AAT AC AG AT T AT GGG AG AT G C AAAG G A 
AAAAAAC C AAAG T T T GG AC AAG TAT T C G T GAT T GG AG C T AT TAG AAG T G GAT C AC T T T T G 
ATTACTTTGGCTGTTGGAATTCTATTTTTTTGCCGTTATAGACACAAGTCAATTACTTTG 
GAAG GAT T T G G T GGAAAG AC C T AC C C AAT GG C AAC AAAT AT AAT CTTCTCTTTGC C AAG C 
AAAG AC GAT T T C T T CAT AAAG T C T G TAT C AG T T AAAC CAT T C AC T T T G GAG TAT AT AG AG 
CAGGCTACAGAACAGTACAAAACTTTGATATGTGAAGGAGGATTTGGTTCTGTTTACAGA 
GGCACTCTAGACGATGGTCAAGAAGTGGCAGTGAAAGTGCGGTCATCCACATCAACTCAG 
GGAACCAAGGAATTTGATAACGAGCTAAACCTACTTTCAGCTATACAACATGAGAACCTG 
GTGCCTCTTCTGGGTTACTGTAATGAGTATGATCAACAAATTCTCGTGTATCCATTCATG 
T C T AAT GGCTCTTTGC TAG AT AG AT TAT AC G GG G AAG CAT CAAAG AG AAAAAT AT TAG AC 
TGGCCAACTAGACTCTCTATTGCTCTCGGTGCACCTCGAGGTTTGGCATAGCTTCACACA 
T T T CC AG G ACC T T C T G T AAT AC AC AG G G AC C T AAAAT C G AGCAAT AT AC T G C T G GAT C AG 
AGCATGTGTGCTAAGGTTGCAGATTTTGGTTTCTCAAAATACGCTCCTCAGGAAGGAGAC 
AGTTATGTTTCCCTTGAAGTAAGAGGAACTGCAGGGTATCTGGATCCTGAGTACTACAAA 
ACCCAGCAATTATCTGAAAAAAGTGATGTTTTCAGCTTTGGTGTGGTTCTACTTGAAATT 
GTAAGTGGACGGGAACCTCTCAACATAAAGAGACCACGGATCGAGTGGAGCTTGGTTGAA 
TGGGCTAAACCATACATAAGAGCATCAAAGGTGGATGAAATTGTAGATCCTGGCATCAAG 
GG AG GAT AT CAT GC AG AAGC AT T G T G GAG AG T T G T G GAAG T AGC AC T G C AAT G T C T AG AA 
CCCTACTCAACATATAGGCCATGCATGGTTGATATTGTCCGCGAGTTGGAGGATGCTCTC 
AT T A T T G AAAAC AAT G CAT C T G AAT AC AT G AAAT C CAT AG AC AG CC T T G GAG GAT C C AAC 
CGCTACTCAATTGTTATGGACAAACGGGCGCTGCCTTCAACTACATCTACAGCAGAATCA 
AC TAT C AC AACC C AAAC C T T G AC AC ACC C T C AAC C GAG AT AG T AAAT G G G T C GAT G G AAT 
TCTTTTGATTTGTTTTTGATCATTGCTTTAGTAATATCCCATTTTCAATGGTAAAGGAGA 
AAAATACTACTTTTGATTGTATTTTCATCCACTCTATGTTTCTTGAAACTGAATCTCTCT 
TGCTCAGCCCCAGTTTTTATGGGTGAAAAGAATAATTTGGGTCAAATGCAAGTGAAACCA 
TATGGTGCATAATTTGAAAGCCATATTATATCATTTGCCAAGTCCAAAGTAAAAATTTCA 
C AAAC TAG T TAG AT T G C GAT T TAG T C TAT AG AC AC T T C AAC AG AG C TAT AT AC AC TAT GG 
TTGACTTGCGACTAATTC 
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Figure 15 . 

CCATATTTTAACAATATTCTTTCTTCTACAAGGGTATAACTTTTATACAAGTTCACTATA 
TTATAGGATTGATCAAGGTTCATTTTTTCTTTCTTTGAAAAATCTCTAAGGGGTGTGGTT 
T C CAAGG C AG AAAAT G AAAT AG AAT G C AG AAG AAT T TG TAT GG T AC T AT AAAG G G AAG AT 
GAAAAGTTAGTTAGCATGGATTCAAGTTTGATAACCCTTTGGGGTAAAATCTCTTTCAGA 
TTATGATGGAGCTACAAGTTATTAGGATATTTAGATTGGTTGTGGCATGTGTTCTTTGTT 
TGTGTATATTTATCAGATCAGCTTCTTCTGCAACTAAAGGGTTTGAGAGCATATCATGTT 
G T G C T GAT TCC AAT T AC AC AGAT CCAAAAAC AACC C T AAC T T AT AC AAC AGAT C AC AT C T 
GGTTCTCT G AT AAAAG AAG T T G C AG AC C AAT AC C C G AAAT T T T G T T TAG C C AC AG AAG C A 
AT AAAAAT GT T C G AAT AT T T G AAAT AG AT G AAG G AAAG AG AT G T TAT AC T T T G C C AAC AA 
TTAAGGATCAAGTATATTTGATAAGGGGTGTATTTCCCTTTGATAGTTTAAATTCTTCGT 
T T TAT G T T TAT AT C G G G G T AAC AG AAC TAG G T G AAT T AAG AT C G T C TAG AC T C GAG G AC T 
TGGAAATCGAGGGAGTTTTTAGAGCCACCAAAGACTATATTGATTTCTGCTTATTGAAGG 
AAGATGTCAATCCCTTCATTTCTCAGATTGAATTGAGGCCATTACCTGAAGAATACCTAC 
AT G G T T T C GC T AC T AGT G T T T T AAAAC T GAT AAG C AG AAAT AAT C T T G G T G AC AC AAAT G 
AT GAT AT AAGGT T CCC AG ATGACCAAAAT G AT AGAATCT GGAAACGGAAAGC AACT T C AA 
CTCCATCATCTGCCCTTCCCCTGTCTTCCAATGTCAGCAATGTTGACCTCAAAGACAGTG 
TCACACCTCCTCTACAAGTCCTACAAACAGCTCTTACTCACCCTGAGCGATTGGAGTTCG 
T CC AT GAT GG C CT C GAG AC C GAT GAT TAT G AAT AC TCTGTGTTTCTC C AC T T T C T T G AAC 
T AAAT G G C AC T G T C AG AG C AG G AC AAAG G G T G T T T G AC AT C TAT C T AAA C AAT GAG AT T A 
AAAAGGAGAAGTTTGATGTTTTGGCTGGAGGGTCCAAGAACAGTTACACTGCCTTGAACA 
T T T C AG C AAAT G GAT C AC T C AAT AT AAC C T TAG T C AAG GC AT C T G GAT C T GAG T T T G G AC 
CCCTTTTGAATGCCTATGAAATCCTGCAGGCACGGTCGTGGATTGAAGAGACCAACCAAA 
AAG AT T T GGAAC T T AT T CAGAAG ATGAGAGAAGAACT GCT GCT GCACAACCG AG AAAAT G 
AAG CAT T GGAG AGT T GG AG T GG AG AC C C T T G TAT GAT T T T C C C C T GG AAAG G AAT AAC AT 
GTGATGATTCAACTGGTTCATCTATTATCACTATGCTGGATCTTTCTTCCAATAATCTCA 
AG G GAG C AAT T C C T T AC T T T G T C AC T AAG AT G AC C AAT T TAG AAAT AC T G AAC C T GAG C C 
ACAACCAGTTCGATTCGTTATTCCCCTCGTTTCCACCGTCCTCCTTGCTGATATCATTGG 
ATCTGAGCTACAATGATCTTGATGGACGGCTTCCAGAATCCATTATCTCACTGCCACATT 
T AAAAT CAT TAT AT T T T G G C T G C AAT C CAT AT AT G AAG G AC G AAG AT AC AAC AAAG T T G A 
ACAGT T CACT AAT C AAT AC AGAT TAT G GGAG AT G C AAAGGAAAAAAAC C AAAG T T T G G AC 
AAGTATTCGTGATTGGAGCTATTACAAGGGGATCACTTTTGATTACTTTGGCTGTTGGAA 
T T C TAT TTTTTTGCCGT TAT AG AC AC AAG T C AAT T AC T T T GG AAG GAT T T G G T G G AAAG A 
CCTACCCAATGGCAACAAATATAATCTTCTCTTTGCCAAGCAAAGACGATTTCTTCATAA 
AGTCTGTATCAGTTAAACCATTCACTTTGGAGTATATAGAGCAGGCTACAGAACAGTACA 
AAACTTTGATAGGTGAAGGAGGATTTGGTTCTGTTTACAGAGGCACTCTAGACGATGGTC 
AAGAAGTGGCAGTGAAAGTGCGGTCATCCACATCAACTCAGGGAACCCNAGAATTTGATA 
ATGAGCTAAACCTACTTTCAGCTATACAACATGAGAACCTGGTGCCTCTTCTGGGTTACT 
GTAATGAGTATGATCAACAAATTCTCGTGTATCCATTCATGTCCAATGGCTCTTTGCTAG 
AT AG AC T AT AC GG G G AAG CAT C AAAG AG AAAAAT AT TAG AC T GG C C AAC TAG AC T C T C T A 
TTGCTCTCGGTGCAGCTCGAGGTTTGGCATATCTTCACACATTTCCAGGACGTTCTGTAA 
TACACAGGGACGTAAAATCGAGCAATATACTGCTGGATCAGAGCATGTGTGCTAAGGTTG 
CAGATTTTGGTTTCTCAAAATACGCTCCTCAGGAAGGAGACAGTTATGTTTCCCTTGAAG 
TAAGAGGAACTGCAGGGTATCTGGATCCTGAGTACTACAAAACCCAGCAATTATCTGAAA 
AAAGTGATGTTTTCAGCTTTGGTGTGGTTCTACTTGAAATTGTAAGTGGACGGGAACCTC 
T C AAC AT AAAG AG AC C AC G GAT C GAG T G G AGC T T G G T T GAAT GG G C T AAAC CAT AC AT AA 
GAG CAT C AAAGG T G GAT G AAAT T G TAG AT C C T G G CAT C AAG GGAGG AT AT CAT G CAGAAG 
CATTGTGGAGAGTTGTGGAAGTAGCACTGCAATGTCTAGAACCCTACTCAACATATAGGC 
CATGCATGGTTGATATTGTCCGCGAGTTGGAGGATGCTCTCATTATTGAAAACAATGCAT 
CTGAATACATGAAATCCATAGACAGCCTTGGAGGATCCAACCGCTACTCAATTGTTATGG 
AC AAAC GGGCGCTGCCTT C AAC T AC AT C T AC AG C AG AAT C AAC TAT C AC AAC C C AAAC C T 
TGACACACCCTCAACCGAGATAGTAAATGGGTCGATGGAATTCTTTTGATTTGTTTTTTA 
T CAT T G C T T T AGT AAT AT CC C AT T T T AAAT G G T AAAG GAG AAAAAT AC T AC T T T T GAT T G 
TATTTTCATCCACTCTATGTTTCTTGAAACTG7\ATCTCTCTTGCTCAGCCCCAGTTTTTA 
TGGGTGAAAAGAATAATTTGGGTCAAATGCAAGTGAAACCATATGGTGCATAATTTGAAA 
GCCATATTATATCATTTGCTAAGTCCAAAGTAAAAATTTCACAAACTAGTTAGATTGCGA 
T T T AGT CT AT AC AC AC T T C AAC AGAG CT AT AT ACACT AT 
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Figure 16. 



NLFQIIMELQVIRIFRLVVACVLCLCIFIRSASSATKGFESIACCADSNYTDPKTTLTYT 
TDHIWFS DKRSCRP I PEILFS HRSNKNVR1 FEI DEGKRCYTLPTI KDQVYLIRGVFPFDS 
LNSSFYVYIGVTELGELRSSRLEDLEIEGVFRATKDYIDFCLLKEDVNPFISQIELRPLP 
EEYLHGFATSVLKLISRNNLGDINDDIRFPDDRNDRIWKRKATSTPSSALPLSFNVSNVD 
LKDSVAPPLQVLQTALTHPERLEFVHDGLETDDYEYSVFLHFLELNGTVRAGQRVFDIYL 
NNEIKKEKFDVLAGGSKNSYTALNISANGSLNITLVKASGSEFGPLLNAYEILQARSWIE 
ETNQKDLELIQKTREELLLHNQENEALESWSGDPCMIFPWKGITCDDSTGSSIITMLDLS 
SNNLKGAIPYFVTKMTNLQILNLSHNQFDSLFPSFPPSSLLISLDLSYNDLDGRLPESII 
SLPHLKSLYFGCNPYMKDEDTTKLNSSLINTDYGRCKGKKPKFGQVFVIGAITSGSLLIT 
LAVGILFFCRYRHKSITLEGFGGKTYPMATNIIFSLPSKDDFFIKSVSVKPFTLEYIEQA 
TEQYKTLICEGGFGSVYRGTLDDGQEVAVKVRSSTSTQGTKEFDNELNLLSAIQHENLVP 
LLGYCNEYDQQILVYPFMSNGSLLDRLYGEASKRKILDWPTRLSIALGAPRGLA^ 
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Figure 17 * 



NLFQIMMELQVIRIFRLVVACVLCLCIFIRSASSATKGFESIACCADSNYTDPKTTLTYT 
TDHIWFSDKRSCRPIPEILFSHRSNKNVRIFEIDEGKRCYTLPTIKDQVYLIRGVFPFDS 
LNSSFYVYIGVTELGELRSSRLEDLEIEGVFRATKDYIDFCLLKEDVNPFISQIELRPLP 
EEYLHGFATSVLKLISRNNLGDTNDDIRFPDDQNDRIWKRKATSTPSSALPLSSNVSNVD 
LKDSVTPPLQVLQTALTHPERLEFVHDGLETDDYEYSVFLHFLELNGTVRAGQRVFDIYL 
NNEIKKEKFDVLAGGSKNSYTALNISANGSLNITLVKASGSEFGPLLNAYEILQARSWIE 
ETNQKDLELIQKMREELLLHNRENEALESWSGDPCMIFPWKGITCDDSTGSSIITMLDLS 
SNNLKGAIPYFVTKMTNLQILNLSHNQFDSLFPSFPPSSLLISLDLSYNDLDGRLPESII 
SLPHLKSLYFGCNPYMKDEDTTKLNSSLINTDYGRCKGKKPKFGQVFVIGAITRGSLLIT 
LAVGILFFCRYRHKSITLEGFGGKTYPMATNIIFSLPSKDDFFIKSVSVKPFTLEYIEQA 
TEQYKTLIGEGGFGSVYRGTLDDGQEVAVKVRSSTSTQGTXEFDNELNLLSAIQHENLVP 
LLGYCNEYDQQILVYPFMSNGSLLDRLYGEASKRKILDWPTRLSIALGAARGLAYLHTFP 
GRSVIHRDVKSSNILLDQSMCAKVADFGFSKYAPQEGDSYVSLEVRGTAGYLDPEYYKTQ 
QLSEKSDVFSFGVVLLEIVSGREPLNIKRPRIEWSLVEWAKPYIRASKVDEIVDPGIKGG 
YHAEALWRVVEVALQCLEPYSTYRPCMVDIVRELEDALIIElslNASEYMKSIDSLGGSNRY 
SIVMDKRALPSTTSTAESTITTQTLTHPQPR* 
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Figure 18, (1/6) 



T T CC AGGCC T AAAGTC AAAC ACC AT AT T TTAAC AAT AT TCTT TCT TCT AC AGCTTTTATACAAGTT MtAl7 

TTCCAGGCCTAAAGTCAAACACCATATTTTAACAATA^ MslK- 

CCATATTTTAACAATATTCTTTCTTCTACAAGGGTATA>\CTTTTATACAAGTT Ms 6N+ 

MTTTTAACAATTTTTCTTTTTCTiUkTTTCTAAC^^ Ps 

CACTATATTATAGGATTGATCAGGGTTCATTTTTTCTTTCTTTGAAAAATCTCTAAGGGGTGCTGTTTCCAAGGCAGAG MtA17 

CACTATATTATAGGATTGATCAAGGTTCATTTTTTCTTT^ MslN- 

CACTATATTATAGGATTGATCAAGGTTCATTTTTTCTTTCTTTGAAAAATCT^ Ms6tt+ 

CAC&AAGGTAT&ACTTTTGTAC&T^^ Ps 

j^TG&AMAG^TT^ MtA17 

AATGAAATAGAATTCAGAAGAATTTTTATGGTACT — AAAGGGAAGATGAGAAGTTAGTTAGCATGGATTCAAATTTGA Ms IN- 

AAT GAAAT AGAAT GCAGAAGAAT TT GTAT GGTACTATAAAGGGAAGAT GAAAAGT TAGTTAGCAT GGATT CAAGT T T GA Ms6N+ 

T GAAAT AGAAT TCAGAAGAAT T T T TAT GGTACTATA — GGGAAGATGGAGAGTTAGTTAGCATGGATTCGAGTTTGA Ps 
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Figure 18. (confc. 2/6) 
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Figure 18. (cont. 3/6) 
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CTA 


TTT 


TTT 


TGC 


Vv 


GGA 


GCT 


ATT 


ACA 


AGT 


GGA 


TCA 


ATT 


TTG 


ATT 


ACT 


TTG 


GCT 


GTT 


GTA 


ATT 


CTA 


TTC 


TTT 


TGC 


Ps 


CGT 


TAT 


AGA 


CAC 


AAG 


TCA ATT 


ACT 


TTG 


GAA 


GGA 


TTT 


GG- 


— A 


AAG 


ACC 


TAC 


CCA 


ATG 


GCA 


MtA17 


CGT 


TAT 


AGA 


CAC 


AAG 


TCA 


ATT 


ACT 


TTG 


GAA 


GGA 


TTT 


GGT 


GGA 


AAG 


ACC 


TAC 


CCA 


ATG 


GCA 


MslN- 


CGT 


TAT 


AGA 


CAC 


AAG 


TCA 


ATT 


ACT 


TTG 


GAA 


GGA 


TTT 


GGT 


GGA 


AAG 


ACC 


TAC 


CCA 


ATG 


GCA 


Ms6N+ 


CGT 


TAT 


AGA 


CAC 


AAG 


TCA 


ATT 


ACT 


TTG 


GAA 


GGA 


TTT 


GGT 


GGA 


AAG 


ACC 


TAC 


CCG 


ATG 


GCA 


Vv 


CGT 


TAT 


AGA 


CAC 


AAG 


TCA 


ATT 


ACT 


TTG 


GAA 


GGA 


TTT 


GGT 


GGA 


AAG 


ACC 


TAC 


CCA 


ATG 


GCA 


Ps 
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Figure 18. (cont. 4/6) 



ACA 


A ±9 AT 


ATA ATT 


TTC 


TCT 


TTG 


CCA 


AGC 


AAA 


GAC 


GAT 


TTC 


TTC 


ATA 


AAG 


TCT 


GTA 


TCA 


MtA17 


ACA 


A 19 AT 


ATA 


ATC 


TTC 


TCT 


TTG 


CCA 


AGC 


AAA 


GAC 


GAT 


TTC 


TTC 


ATA 


AAG 


TCT 


GTA 


TCA 




ACA 


A 19 AT 


ATA ATC 


TTC 


TCT 


TTG 


CCA 


AGC 


AAA 


GAC 


GAT 


TTC 


TTC 


ATA AAG 


TCT 


GTA 


TCA 


MS6N+ 


ACA 


A i9 AT 


ATA 


ATT 


TTC 


TCC 


TTG 


CCA 


AGC 


AAA 


GAC 


GAT 


TTC 


TTC 


ATA 


AAG 


TCT 


GTA 


TCA 


Vv 


ACA 


A 19 AT 


ATA ATT 


TTT 


TCC 


TTG 


CCA 


AGC 


AAA 


GAC 


GAT 


TTC 


TTC 


ATA 


AAG 


TCT 


GTA 


TCA 


Ps 


GTT 


AAA 


CCG 


TTC 


ACT 


TTG 


GAG 


TAT 


ATA 


GAG 


CAG 


GCT 


ACA 


GAA 


CAG 


TAC 


AAA 


ACT 


TTA 


ATA 


MtA17 


GTT 


AAA 


CCA 


TTC 


ACT 


TTG 


GAG 


TAT 


ATA 


GAG 


CAG 


GCT 


ACA 


GAA 


CAG 


TAC 


AAA 


ACT 


TTG 


ATA 


HslN- 


GTT 


AAA 


CCA 


TTC 


ACT 


TTG 


GAG 


TAT 


ATA 


GAG 


CAG 


GCT 


ACA 


GAA 


CAG 


TAC 


AAA 


ACT 


TTG 


ATA 


Ms6N+ 


GTT 


AAA 


CCG 


TTC 


TCT 


TTG 


GAG 


TAT 


ATA 


GAG 


TTG 


GCA 


ACA 


GAG 


AAG 


TAC 


AAA 


ACT 


TTG 


ATA 


Vv 


GTT 


AAG 


CCA 


TTC 


ACT 


TTG 


GAG 


TAT 


ATA 


GAG 


TTG 


GCA 


ACA 


GAG 


AAG 


TAC 


AAA 


ACT 


TTG 


ATA 


Ps 


GGT 


GAA 


GGA 


GGA 


TTT 


GGC 


TCT 


GTT 


TAC 


AGG 


GGC 


ACT 


CTA 


GAC 


GAT 


GGT 


CAA 


GAA 


GTG 


GCA 


MtA17 


TGT 


GAA 


GGA 


GGA 


TTT 


GGT 


TCT 


GTT 


TAC 


AGA 


GGC 


ACT 


CTA 


GAC 


GAT 


GGT 


CAA 


GAA 


GTG 


GCA 


MslN- 


GGT 


GAA 


GGA 


GGA 


TTT 


GGT 


TCT 


GTT 


TAC 


AGA 


GGC 


ACT 


CTA 


GAC 


GAT 


GGT 


CAA 


GAA 


GTG 


GCA 


Ms6N+ 


GGT 


GAA 


GGA 


GGG 


TTT 


GGC 


TCT 


GTT 


TAC 


CGA 


GGC 


ACT 


CTA 


GAC 


GAT 


GGT 


CAA 


GAA 


GTG 


GCA 


Vv 


GGT 


GAA 


GGA 


GGA 


TTT 


GGC 


TCT 


GTT 


TAC 


CGG 


GGC 


ACT 


CTA 


GAC 


GAT 


GGA 


CAA 


GAA 


GTG 


GCA 


Ps 


GTG 


AAA 


GTG 


CGG 


TCA 


TCC 


ACA 


TCA 


ACT 


CAG 


GGA 


ACC 


CGA 


GAA 


TTT 


GAT 


AAT 


GAG 


110 


CTA 


MtA17 


GTG 


AAA 


GTG 


CGG 


TCA 


TCC 


ACA 


TCA 


ACT 


CAG 


GGA 


ACC 


AAG 


GAA 


TTT 


GAT 


AAC 


GAG 


ilO 


CTA 


Ms IN- 


GTG 


AAA 


GTG 


CGG 


TCA 


TCC 


ACA 


TCA 


ACT 


CAG 


GGA 


ACC 


CGA 


GAA 


TTT 


GAT 


AAT 


GAG 


110 


CTA 


Ms6N+ 


GTG 


AAA 


GTC 


CGG 


TCA 


GCC 


ACA 


TCA 


ACT 


CAG 


GGA 


ACC 


CGA 


GAA 


TTT 


GAC 


AAT 


GAG 


110 


CTA 


Vv 


GTG 


AAA 


GTC 


CGG 


TCA 


GCC 


ACA 


TCA 


ACT 


CAG 


GGA 


ACC 


CGA 


GAA 


TTT 


GAC 


AAT 


GAG 


ilO 


CTA 


Ps 


AAC 


CTA 


CTT 


TCA 


GCT 


ATA CAA 


CAT 


GAG 


AAC 


CTG 


GTG 


CCT 


CTT 


CTG 


GGT 


TAC 


TGT 


AAT 


GAG 


MtA17 


AAC 


CTA 


CTT 


TCA 


GCT 


ATA CAA 


CAT 


GAG 


AAC 


CTG 


GTG 


CCT 


CTT 


CTG 


GGT 


TAC 


TGT 


AAT 


GAG 


MslN- 


AAC 


CTA 


CTT 


TCA 


GCT 


ATA 


CAA 


CAT 


GAG 


AAC 


CTG 


GTG 


CCT 


CTT 


CTG 


GGT 


TAC 


TGT 


AAT 


GAG 


MS6NH- 


AAC 


CTA 


CTT 


TCA 


GCT 


ATA 


CAA 


CAT 


GAG 


AAC 


CTG 


GTG 


CCT 


CTT 


CTT 


GGT 


TAC 


TGT 


AAT 


GAA 


Vv 


AAC 


CTA 


CTT 


TCA 


GCT 


ATA 


CAA 


CAT 


GAG 


AAC 


CTG 


GTG 


CCT 


CTT 


CTT 


GGT 


TAC 


TGT 


AAT 


GAA 


Ps 


TAT 


GAT 


CAA 


CAA ATT 


CTC 


GTG 


TAT 


CCT 


TTC 


ATG 


TCC 


AAT 


GGC 


TCT 


TTG 


CTA 


GAT 


AGA 


CTA 


MtA17 


TAT 


GAT 


CAA 


CAA 


ATT 


CTC 


GTG 


TAT 


CCA 


TTC 


ATG 


TCT 


AAT 


GGC 


TCT 


TTG 


CTA 


GAT 


AGA 


TTA 


Ms IN- 


TAT 


GAT 


CAA 


CAA 


ATT 


CTC 


GTG 


TAT 


CCA 


TTC 


ATG 


TCC 


AAT 


GGC 


TCT 


TTG 


CTA 


GAT 


AGA 


CTA 


Ms 6N+ 


TAT 


GAT 


CAA 


CAA ATT 


CTT 


GTG 


TAG 


CCT 


TTC 


ATG 


TCT 


AAT 


GGA 


TCT 


TTG 


CTA 


GAT 


AGA 


CTA 


Vv 


TAT 


GAT 


CAA 


CAA ATT 


CTT 


GTA 


TAT 


CCT 


TTC 


ATG 


TCT 


AAT 


GGC 


TCT 


TTG 


CTA 


TAC 


AGA 


CTA 


Ps 



TAC 


G ill GG GAA GCA TCA AAG AGA AAA ATA 


TTA 


GAC 


TGG 


CCA 


ACT 


AGA 


CTC 


TCT 


ATT 


GCT 


MtAl7 


TAC 


G ill GG GAA GCA TCA AAG AGA AAA ATA 


TTA 


GAC 


TGG 


CCA 


ACT 


AGA 


CTC 


TCT 


ATT 


GCT 


MslN- 


TAC 


G ill GG GAA GCA TCA AAG AGA AAA ATA 


TTA 


GAC 


TGG 


CCA 


ACT 


AGA 


CTC 


TCT 


ATT 


GCT 


MS6N+ 


TAT 


G ill GG GAA CCA GCA AAG AGA AAA ATA CTA 


GAC 


TGG 


CCA ACT 


AGA 


CTC 


TCT 


ATT 


GCT 


Vv 


TAT 


G ill GG GAA CCG GCA AAG AGA AAA ATA 


CTA 


GAC 


TGG 


CCA ACT 


AGA 


CTC 


TCA ATT 


GCT 


Ps 


CTC 


GGT GCA GCT CGA G 112 GT TTG GCA TAT 


CTT 


CAC 


ACA 


TTT 


CCA 


GGA 


CGT 


TCT 


GTA 


ATA 


MtA17 


CTC 


GGT GCA CCT CGA G 112 GT TTG GCA TAG 


CTT 


CAC 


ACA 


TTT 


CCA 


GGA 


CCT 


TCT 


GTA 


ATA 


MslN- 


CTC 


GGT GCA GCT CGA G 112 GT TTG GCA TA!1>( 


5CTT 


CAC 


ACA 


TTT 


CCA 


GGA 


CGT 


TCT 


GTA 


ATA 




CTA 


GGA GCA GCT CGA G i!2 GT TTG GCA TAT 


\tt 


CAC 


ACA 


TTT 


CCT 


GGA 


CGG 


TCT 


GTA 


ATA 


Vv 


CTA 


GGA GCA GCT CGA G il2 GT TTG GCA TAT 


CTT 


CAC 


ACG 


TTT 


CCT 


GGA 


CGA 


TCT 


GTA 


ATA 


Ps 



mutation resulting in In frame STOP codon 



CAC 


AGG 


GAC 


GTA 


AAA 


TCG 


AGC 


AAT 


ATA 


CTG 


CTG 


GAT 


CAG 


AGC 


ATG 


TGT 


GCT 


AAG 


GTT 


GCA 


MtA17 


CAC 


AGG 


GAC 


CTA 


AAA 


TCG 


AGC 


AAT 


ATA 


CTG 


CTG 


GAT 


CAG 


AGC 


ATG 


TGT 


GCT 


AAG 


GTT 


GCA 


MslN- 


CAC 


AGG 


GAC 


GTA 


AAA 


TCG 


AGC 


AAT 


ATA 


CTG 


CTG 


GAT 


CAG 


AGC 


ATG 


TGT 


GCT 


AAG 


GTT 


GCA 


Ms6N+ 


CAC 


AGG 


GAC 


GTA 


AAA 


TCG 


AGC 


AAT 


ATA 


CTA 


CTG 


GAC 


CAT 


AGC 


ATG 


TGT 


GCT 


AAG 


GTC 


GCA 


W 


CAC 


AGG 


GAC 


GTA 


AAA 


TCG 


AGC 


AAT 


ATA 


CTA 


CTG 


GAC 


CAT 


AGC 


ATG 


TGT 


GCT 


AAG 


GTT 


GCA 


Ps 
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Figure 18. (cont. 5/6) 



GAT 


TTT 


GGT 


TTC 


TCA AAA 


TAT 


GCT 


CCT 


CAG 


GAA 


GGA 


GAC 


AGT 


TAT 


GTT 


TCC 


CTT 


GAA 


GTA 


MtA17 


GAT 


TTT 


GGT 


TTC 


TCA AAA TAC 


GCT 


CCT 


CAG 


GAA 


GGA 


GAC 


AGT 


TAT 


GTT 


TCC 


CTT 


GAA 


GTA 


MslN- 


GAT 


TTT 


GGT 


TTC 


TCA 


AAA 


TAC 


GCT 


CCT 


CAG 


GAA 


GGA 


GAC 


AGT 


TAT 


GTT 


TCC 


CTT 


GAA 


GTA 




GAT 


TTT 


GGT 


TTC 


TCA AAA 


TAC 


GCT 


CCT 


CAG 


GAA 


GGA 


GAC 


AGT 


TAT 


GTT 


TCC 


CTT 


GAA 


GTA 


Vv 


GAT 


TTT 


GGT 


TTC 


TCA AAA 


TAC 


GCT 


CCT 


CAG 


GAA 


GGA 


GAC 


AGT 


TAT 


GTT 


TCC 


CTT 


GAA 


GTA 


Ps 


AGA 


GGA 


ACT 


GCA 


GGG 


TAT 


CTG 


GAT 


CCT 


GAG 


TAC 


TAC 


AAA 


ACC 


CAG 


CAA 


TTA 


TCT 


GAA 


AAA 


MtA17 


AGA 


GGA 


ACT 


GCA 


GGG 


TAT 


CTG 


GAT 


CCT 


GAG 


TAC 


TAC 


AAA 


ACC 


CAG 


CAA 


TTA 


TCT 


GAA 


AAA 


MslN- 


AGA 


GGA 


ACT 


GCA 


GGG 


TAT 


CTG 


GAT 


CCT 


GAG 


TAC 


TAC 


AAA 


ACC 


CAG 


CAA 


TTA 


TCT 


GAA 


AAA 


M<* 6KT+ 


AGA 


GGA 


ACT 


GCA 


GGG 


TAT 


CTG 


GAT 


CCC 


GAG 


TAC 


TAC 


AAA 


ACC 


CAG 


CAA 


TTA 


TCT 


GAA 


AAG 


Vv 




GGA 


ACC 


GCA 


GGA 


TAT 


CTG 


GAT 


CCT 


GAG 


TAC 


TAC 


AAA 


ACC 


CAG 


CAA 


TTA 


TCT 


GAA 


AAG 


Ps 


AGT 


GAT 


il3 


GTT 


TTC 


AGC 


TTT 


GGT 


GTG 


GTT 


CTT 


CTT 


GAA 


ATT 


GTA 


AGC 


GGA 


CGG 


GAA 


CCT 


MtA17 


AGT 


GAT 


il3 


GTT 


TTC 


AGC 


TTT 


GGT 


GTG 


GTT 


CTA 


CTT 


GAA 


ATT 


GTA 


AGT 


GGA 


CGG 


GAA 


CCT 


MslN- 


AGT 


GAT 


113 


GTT 


TTC 


AGC 


TTT 


GGT 


GTG 


GTT 


CTA 


CTT 


GAA 


ATT 


GTA AGT 


GGA 


CGG 


GAA 


CCT 


Ms 6N+ 


AGT 


GAT 


113 


GTT 


TTT 


AGT 


TTT 


GGC 


GTG 


GTT 


CTA 


CTA 


GAA 


ATT 


GTA AGT 


GGC 


CGG 


GAA 


CCT 


Vv 


AGT 


GAT 


113 


GTT 


TTC 


AGC 


TTT 


GGC 


GTG 


GTT 


CTA 


CTA 


GAA 


ATT 


GTA AGT 


GGC 


CGG 


GAA 


CCT 


Ps 


CTC 


AAC 


ATA 


AAG 


AGA 


CCA 


AGG 


ATC 


GAG 


TGG 


AGC 


TTG 


GTT 


GAA 


TGG 


114 


GCT 


AAA 


CCA 


TAC 


MtA17 


CTC 


A AC 


ATA 


AAG 


AGA 


CCA 


CGG 


ATC 


GAG 


TGG 


AGC 


TTG 


GTT 


GAA 


TGG 


114 


GCT 


AAA 


CCA 


TAC 


Ms IN- 


CTC 


AAC 


ATA 


AAG 


AGA 


CCA 


CGG 


ATC 


GAG 


TGG 


AGC 


TTG 


GTT 


GAA 


TGG 


il4 


GCT 


AAA 


CCA 


TAC 


KS 6N+ 


CTC 


AAC 


ATA 


AAG 


AGA 


CCA 


CGA 


GTC 


GAG 


TGG 


AGC 


TTG 


GTT 


GAA 


TGG ±14 


GCT 


AAA 


CCA 


TAC 


Vv 


CTC 


AAC 


ATA 


AAG 


AGA 


CCA 


CGA 


GTC 


GAG 


TGG 


AGC 


TTG 


GTT 


GAA 


TGG 


xl4 


GCT 


AAA 


CCA 


TAT 


Ps 


ATA 


AGA 


GCA 


TCA 


AAG 


GTG 


GAT 


GAA 


ATT 


GTA 


GAT 


CCT 


GGC 


ATC 


AAG 


GGA 


GGA 


TAT 


CAT 


GCA 


MtA17 


ATA 


AGA 


GCA 


TCA 


AAG 


GTG 


GAT 


GAA 


ATT 


GTA 


GAT 


CCT 


GGC 


ATC 


AAG 


GGA 


GGA 


TAT 


CAT 


GCA 


Ms IN- 


ATA 


AGA 


GCA 


TCA 


AAG 


GTG 


GAT 


GAA 


ATT 


GTA 


GAT 


CCT 


GGC 


ATC 


AAG 


GGA 


GGA 


TAT 


CAT 


GCA 


KS 6N+ 


ATA 


AGA 


GCA 


TCA 


AAG 


GTG 


GAT 


GAA 


ATT 


GTA 


GAT 


CCT 


GGC 


ATC 


AAG 


GGA 


GGA 


TAT 


CAT 


GCA 


Vv 


ATA 


AGA 


GCA 


TCA 


AAG 


GTG 


GAT 


GAA 


ATT 


GTA 


GAT 


CCT 


GGC 


ATC 


AAG 


GGA 


GGA 


TAT 


CAT 


GCA 


Ps 


GAG 


GCA 


TTG 


TGG 


AGA 


GTT 


GTG 


GAA 


GTA 


GCA 


CTA 


CAA 


TGT 


CTA 


GAA 


CCC 


TAC 


TCA 


ACA 


TAT 


MtA17 


GAA 


GCA 


TTG 

JL JL VJ 


TGG 


AGA 


GTT 


GTG 


GAA 


GTA 


GCA 


CTG 


CAA 


TGT 


CTA 


GAA 


CCC 


TAC 


TCA 


ACA 


TAT 


Ms IN— 


GAA 


GCA 


TTG 


TGG 


AGA 


GTT 


GTG 


GAA 


GTA 


GCA 


CTG 


CAA 


TGT 


CTA 


GAA 


CCC 


TAC 


TCA 


ACA 


TAT 


Ms 6N+ 


GAG 


GCA 


CTG 


TGG 


AGA 


GTG 


GTG 


GAA 


GTA 


GCA 


CTG 


CAA 


TGT 


CT 














Vv 


GAG 


GCA 


CTG 


TGG 


AGA 


GTT 






























Ps 


CGG 


CCA 


TGC 


ATG 


GTT 


GAT 


ATT 


GTC 


CGC 


GAG 


TTG 


GAG 


GAT 


GCT 


CTC 


ATT 


ATT 


GAA 


AAC 


AAT 


MtA17 


AGG 


CCA 


TGC 




GTT 


GAT 


ATT 


GTC 


CGC 


GAG 


TTG 


GAG 


GAT 


GCT 


CTC 


ATT 


ATT 


GAA 


AAC 


AAT 


Ms 1N- 


AGG 


CCA 


TGC 


ATG 


GTT 


GAT 


ATT 


GTC 


CGC 


GAG 


TTG 


GAG 


GAT 


GCT 


CTC 


ATT 


ATT 


GAA 


AAC 


AAT 


MS6N+ 


GCA 


TCT 


GAA 


TAC 


ATG 


AAA 


TCC 


ATA 


GAC 


AGC 


CTT 


GGA 


GGA 


TCC 


AAC 


CGC 


TAC 


TCA 


ATT 


GTT 


MtA17 


GCA 


TCT 


GAA 


TAC 


ATG 


AAA 


TCC 


ATA 


GAC 


AGC 


CTT 


GGA 


GGA 


TCC 


AAC 


CGC 


TAC 


TCA 


ATT 


GTT 


MslN- 


GCA 


TCT 


GAA 


TAC 


ATG 


AAA 


TCC 


ATA 


GAC 


AGC 


CTT 


GGA 


GGA 


TCC 


AAC 


CGC 


TAC 


TCA 


ATT 


GTT 


Ms6N-f- 


ATG 


GAC 


AAA 


CGG 


GCG 


CTG 


CCT 


TCA 


ACT 


ACA 


TCT 


ACA 


GCA 


GAA 


TCA ACT 


ATC 


ACA 


ACC 


CAA 


MtA17 


ATG 


GAC 


AAA 


CGG 


GCG 


CTG 


CCT 


TCA 


ACT 


ACA 


TCT 


ACA 


GCA 


GAA 


TCA 


ACT 


ATC 


ACA 


ACC 


CAA 


MslN- 


ATG 


GAC 


AAA 


CGG 


GCG 


CTG 


CCT 


TCA 


ACT 


ACA 


TCT 


ACA 


GCA 


GAA 


TCA ACT 


ATC 


ACA 


ACC 


CAA 


Ms6N+ 



ACC TTG TCA CAC CCT CAA CCG AGA TAG TAA ATGGGTCGA — TGGAATTCTTTTGATTTGTTTTTGATCATT MtAl7 
ACC TTG ACA CAC CCT CAA CCG AGA TAG TAA ATGGGTCGA — TGGAATTCTTTTGATTTGTTTTTGATCATT MslN- 
ACC TTG ACA CAC CCT CAA CCG AGA TAG TAA ATGGGTCGA — TGGAATTCTTTTGATTTGTTTTTTATCATT Ms6N+ 
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Figure 18 . (cont. 6/6) 

GCTTTAGTAMATCACATTTTAAATGGTAAAGGAGAAAAATACT^ MtA17 

GCTTTAGTAATATCCCATTTTCAATGGTAAAGGAGAAAAAT^^ MslN- 

GCTTTAGTAATATCCCATTTTA^TGGTAAkGGAGA^ Ms6tf4- 

T GA^CTGAATCTCTCTTGCTCAGCCCCAGTT TTTAT GGGTGAAAAGAATAATTTGGGTCAAATGCAAGTGAAACCATA MtAl7 

TGA^CTGAATCTCTCTTGCTCAGCCCCAGTTTTTATGGGTGA^^^ MslN-^ 

TGi^CTGAATCTCTCTTGCTCAGCCCCAGTTTTTATGGGT^ Ms6N+ 

TGGTGCATAATTTAAA&GCCATATCATATCA^ MtA17 

TGGTGCATAATTTGA2U^GCCATATTATATCATTTGC MslN- 

TGGTGCATA&TTTG&AAGCCATATTATATCATTTGCT Ms6N+ 

TAG5CTATAGACACTTCAACAGAGCTATATACACTA0!GGTTGACTTGCGA MtAl7 

T AGTCT AT AGAC ACTTCAAC AGAGCT AT ATAC ACT ATGGTTGACT TGCGACT AAT TC MslN- 

TAGTCTATACACACTTCAACAGAGCTATATACACTAT Ms 
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Figiare 19. (1/2) 

MMELQVIRIFRLVVAFVTXLC I FIRSAS SATKGFES IACCADSNYTDPKTTLTYTTDHIW MtA17 
IMELQVIRI FRLVVACVLCLC I FIRSAS SATKGFES IACCADSNYTDPKTTLTYTTDHIW MslN- 
MMELQVIRI FRLVVACVLCLC I FIRSAS SATKGFES IACCADSNYTDPKTTLTYTTDHIW Ms 6N+ 
MELRVICI IRLVVACVLCLCI FIRSAS SATEGFES IACCADS P s 

FSDKRSCRQIPEILFSHRSNKNVRKFEIYEGKRCYMLPTVKDQVYLIRGIFPFDSLNSSF MtA17 
FS DKRS CRP I PE I L FS HRSNICNA/R I FE I DE GKRC YTL P T I KDQVYL I RGV FP FDS LNS S F Ms 1N- 
FSDKRSCRP I PE ILFSHRSNKNVRI FE IDEGKRCYTLPT IKDQVYLIRGVFPFDSLNSS F Ms 6N+ 

YVSIGVTELGELRSSRLEDLEIEGVFRATKDYIDFCLLKEDVNPFISQIELRPLPEEYLH MtA17 
YVYIGVTELGELRSSRLEDLEIEGVFRATKDYIDFCLLKEDVNPFISQIELRPLPEEYLH Ms 1N- 
YVY I GVTELGELRS SRLEDLE I E GVFRATKD YI DFCLLKEDVNPFI S Q I E LRPLPEE YLH Ms 6N+ 

GI DFCLLKEDANPFI SQLELRPLPEEYMH Vv 

GFGTSVlKLISRIsnsrLGDTO MtAl7 
G FAT S VLKL I S RNNLGD INDD I R FPDDKMDR I WKRKAT S T P S SAL P L S FNVSNVDLKDS V Ms 1N~ 
G FAT S VLKL I SRNNLGDTNDD IRFPDDQNDRI WKRKAT S T P S S ALPLS SNVSNVDLKDS V Ms 6N+- 
DFSTSVLKLIHRNNLCGIEDDIRFPVDQ1TORIWK — ATSTPS YAVPLS FNVSDVDLNGKV Vv 

TPPLQVLQTALTHPERLEFVHDGLETDDYEYSVFLHFLELNGTVRAGQRVFDI YLNNEIK MtAl 7 
APPLQVLQTALTHPERLEFVHDGLETDDYE YS VFLHFLELNGTVRAGQRVFDI YLNNEIK Ms 1N~ 
T PPLQ VLQTALTHPERLE FVHDGLE TDDYE YSVFLHFLELNGTVRAGQRVFDI YLNNE I K Ms 6N+ 
TPPRQVLQTALTHPDRLVFVHDGLETDDYEYSVLLYFLELNNTV^ Vv 

KEKFDVLAGGSKNS YTALNI SANGSLNI TL VKASGSE FGPLLNAYE ILQARSWIEETNQK MtAl 7 
KEKFDVLAGGSKWS YTALNI SANGSLNI TLVKASGSE FGPLLNAYE I LQARS WIEETNQK MslN- 
KE KFDVLAGGS KMS YTALNI SANGS LNI T L VKAS GS E FG PL LNAYE ILQARSWIEETNQK Ms 6M4- 
KES FDVSEGGSKYS YITLNI SANGSLNI TL2VKASGSK FGPLLNAYE ILQARPWIDETDQT Vv 
FDVLEGGSKYSYTVLNI SANGSLNI TL WASGSKFGPLLNAYE ILQARPWIDETDQT Ps 

DLE VI QKMREELLLHNQENEALES WS GDPCMI FPWKG I TCDDS TGS S 1 1 TKLDLS SNNLK MtAl 7 
DLELI QKTREELLLHNQENEALESWSGDPCMI FPWKG I TCDDS TGS 511 TMLDLS SNNLK MslN- 
DLEL I QKMREELLLHNRENEALE SWS GDPCMI FPWKG I TCDDS TG S S 1 1 TMLDLS SNNLK Ms 6N4- 
DVE VIQKLRKELLLQNQDNEALES WSGDPCMLFPWKGVACDGSNGS S VITKLDLS S SNLK Vv 
DLEVIQKMRKELLLQNQDNEALES^SGDPCMLFPWKGVACDGSNGS SVI TKLDLS S SNLK Ps 

GAI PS I VTKMTNLQILNLSHNQFDMLFPS FPPS SLLI SLDLS YNDLSGWLPES 1 1 SLPHL MtAl 7 
GAIPYFVTKMTNLQILNLSHNQFDSLFPSFPPSSLLI SLDLSYNDLDGRLPESI ISLPHL MslN- 
GAIPYFVTKMTNLQILNLSHNQFDSLFPSFPPSSLLISLDLSYNDLDGRLPESIISLPHL Ms6N+ 
GTIPSSVTEMTKLQILNLSHNHFDGYIPSFPSSSLLISVDLSYNDLTGQLPESIISLPHL Vv 
GTIPSSVTEMTKLQILNLSHNHFDGYIPSFPPSSLLISVDLSYNDLTGQLPESIISLPHL Ps 
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Figure 19. (2/2) 

K3LYFGCNPSMSDEDTTKLNS SL INTDYGRCKAKKPKFGQVFVI GAI TSG3LLI TLAVGI MtAl 7 
KSLY FGCNP YMKDEDTTKLNS S L INTDYGRCKGKKPKFGQVFV I GAI T S GSLL I TLAVGI MslN- 
KSL YFGCNP YMKDEDTTKLNS S L INTDYGRCKGKKPKFGQVFVI GAI TS GSLL I TLAVGI Ms 6N+ 
NSLYFGCNQHMSDDDEMCLNSSL IS TDYGRCKZVKS PKFGQVFMI GAI TSGS ILI TLAVGI Vv 
NSLYFGCNQHMRDDDEAKLNS SL INTDYGRCMAKKPKFGQVFMI GAI TSGS I LI TLAWI Ps 

LFFCRYRHKS I TLEGFG-KTYPMATNI I FSLPSKDDFFIKSVSVKPFTLEYIEQATEQYK MtAl 7 
LFFCRYRHKS ITLEGFGGKTYPMATNI I FSLPSKDDFFIKSVSVKPFTLEYIEQATEQYK MslN- 
LFFCRYRHKS ITLEGFGGKTYPMATNI I FSLPSKDDFFIKSVSVKPFTLEYIEQATEQYK Ms6N+ 
LFFCRYRHKS ITLEGFGGKTYPMATNI I FSLPSKDDFFIKSVSVKPFSLEYIELATEKYK Vv 
LFFCRYRHKS ITLEGFGGKTYPMATNI I FSLPSKI)DFFIKSVSVKPFTLEYIELATEKYK Ps 

TL I GE GGFGS VYRGTLDDGQEVAVKVRS S T S TQGTREFDNELNLLS AIQHENLVPLLGYC MtAl 7 
TLI CEGGFGS VYRGTLDDGQEVAVKVRS S TS TQGTKEFDNELNLLSAIQHENLVPLLGYC MslN- 
TL I GE GG FGS VYRGTLDDGQEVAVKVRS S T S TQG TXE FDNELNLLS AI QHENLVPLLG YC Ms 6N+ 
TLI GEGGFGS VYRGTLDDGQEVAVKVRS AT S TQGTRE FDNELNLL S AIQHENLVPLLGYC Vv 
TL I GE GGFGSVYRGTLDDGQEVAVKVRSAT S TQGTREFDNELNLL S AI QHENLVPLLGYC Ps 

NE YDQQ I LVY P FMSNGS LL DRL YGEAS KRKI LDWPT RLS I ALGAARGLAYLHT FPGRS V I MtAl 7 
NE Y DQQI LVY P FMS NGS LLDRL Y GEAS KRKI LDW P TRL S I AL GAPRGLA* Ms 1N- 

l^YDQQILVYPFMSNGSLLmLYGEASKRKILDWPTRLSIALGAARGLAYLHTFPGRSV^ Ms 6N-f 
NE YDQQ I LWP FMSNGS LLDRL YGEPAKRKILDWP TRL SI ALGAARGLAYLHT FPGRS VI Vv 
NE YDQQ I LW P EM S NGS LLDRL Y GEPAKRK I LDWP TRL SI ALGAARGLAYLHT FPGRS VI Ps 

HRDVKS SNILLDQSMCAKVADFG FSKYAPQEGDS YVS LEVRGTAGYLDPE YYKT QQLS EK MtAl 7 
HRDVKS SNI LLDQSMCAKVADFGFSKYAPQEGDS YVSLE VRGTAGYLDPEYYKT QQLSEK Ms 6N+ 
HRDVKS SN ILLDHSMCAKVADFG FSKYAPQEGDS YVSLEVRGTAGYLDPE YYKT QQLS EK Vv 
HRDVKS SNI LLDHSMCAKVADFGFSKYAPQEGDS YVSLEVRGTAG YLDPEYYKT QQLSEK Ps 

SDVFS FGWLLE I VSGREPLNIKRPRI EW3LVEWAKPY IRA5KVDE I VDPGIKGGYHAEA MtAl 7 
S DVFS FG WLLE I VSGREPLNI KRPRI E WS LVE WAKP Y I RAS KVDE I VDPGIKGG YHAEA Ms 6N+ 
SDVFSFGVVLLEIVSGREPLNIKRPRVEWSLVEWAKPYIRASKVDEIVDPGIKGGYHAEA Vv 
SDVFS FGWLLE I VSGREPLN I KRPRVE^ SLVEWAKPY I RASKVDE I VDPGI KGGYHAEA Ps 

LWRVVEVALQCLEPYSTYRPCMVDIVRELEDALI IENNASEYMKS I DSLGGSNRYS I VMD MtAl 7 
LWRWEVALQCLEPYSTYRPCMVDIVRELEDALI IENNASEYMKS I DSLGGSNRYS I VMD Ms 6N+ 



LTflRWEVALQCL Vv 

LWRV Ps 

KRALPSTTSTAESTITTQTLSHPQPR" - MtAl 7 

KRALP5TT5TAESTITTQTLTHPQFR* * Ms 6N+ 
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Figure 20. 
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Figure 21. 
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Figure 22. 
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Figure 23. 
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Figure 24. 
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Figure 25. 
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Figure 26. 
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L. j. = jLo/ms japonicus cv. funakura 

soybean = Glycine max cv. Williams 
pea = Pisum sativum cv. ujniajori 

bean = Phaseolus vulgaris cv. Juliska 

alfalfa = Medicago saliva ssp. saliva (EcoRI digestion) 
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Figure 27. 
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rice = Oryza sativa 

tobacco = Nicotiana tabacum cv. Small Havanna SRI 
maize = Zea mays 

wheat = Triticum aestivum 

L. j. = Lotus japonicus cv. funakura 
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Figure 28. 



TTTTTTAACAATTTTTCTTTTTCTAATTTCT7\ACCCTCCATCAAATTGAGAACTTTTGTA 
AAAT G G T T T CAAAT CCT T T C AC AAAGGT AT AAC T T T T GT ACAT G T T TAG TAT AT T AG AGG 
AT T GAT C AAG TTCCCTTT G AAAAAT C T C TAG G G G G T G AAA T AG AAT T C AG AAG AAT T T T T 
ATGGTACTATAGGGAAGATGGAGAGTTAGTTAGCATGGATTCGAGTTTGAGAACCCTTTG 
GGGTAACCTATCTTTCGGATTATGATGGAGCTACGAGTTATTTGTATAATAAGATTGGTT 
GTGGCGTGTGTTCTTTGTCTGTGTATATTTATCAGATCAGCTTCTTCTGCAACTGAAGGG 
TTTGAGAGCATAGCATGCTGTGCTGATTCGAANNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNN]SINNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN1SINN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNMNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNTTTGATGTATTGGAAGGAGGATCCAAGTACAGTTACACT 
GTCTTGAACATTTCAGCAAATGGATCACTCAATATAACCTTAGTCAAGGCATCTGGATCT 
AAGTTTGGACCCCTTTTGAATGCCTATGAGATCCTGCAGGCGCGACCATGGATCGATGAG 
AC C G AC C AAAC AG AT C T G G AAG T T AT T C AG AAG AT G AG AAAAG AAC TGCTGCTG C AAAAC 
C AAG AC AAT G AAG CAT T G G AGA G T T GG AGT G GAG AT C C T T G T AT GC T T T T T C CAT G G AAA 
GGAGTAGCATGTGATGGTTCAAATGGTTCGTCTGTCATCACTAAGCTGGATCTTTCCTCC 
AGTAATCTCAAAGGAACAATTCCTTCCAGTGTCACTGAGATGACCAAATTACAAATACTG 
AACCTGAGCCACAACCATTTCGATGGTTATATCCCCTCATTTCCACCGTCCTCCTTGTTG 
AT AT C AG TAG AT C T G AGC T AC AAT GAC C T AAC GGGAC AG C T T C C AGAAT C CAT T AT C T C A 
C T G CC AC AT T T AAAC T CAT TAT AT T T T G G C T GC AAT C AAC AC AT GAG AG AC GAT GAT G AA 
GCC AAAT T GAACAGT TCACTAAT C AAT ACAGAT T ATGGGAGAT GT AAT G C AAAAAAAC C C 
AAAT T T G GAC AAG TAT T CAT GAT T G GAG C T A T T AC AAG T G GAT C AAT T T T GAT T AC T T T G 
GCTGTTGTAATTCTATTCTTTTGCCGTTATAGACACAAGTCAATTACTTTGGAAGGATTT 
G G T GGAAAG AC C T ACCCAAT GG C AAC AAAT AT AAT TTTTTCCTTGC C AAG C AAAGACG AT 
T T C T T C AT AAAG T C T G TAT C AG T T AAG C CAT T C AC T T T G GAG T AT AT AG AG T T G G C AAC A 
GAGAAGTACAAAACTTTGATAGGTGAAGGAGGATTTGGCTCTGTTTACCGGGGCACTCTA 
GACGATGGACAAGAAGTGGCAGTGAAAGTCCGGTCAGCCACATCAACTCAGGGAACCCGA 
GAATTTGACAATGAGCTAAACCTACTTTCAGCTATACAACATGAGAACCTGGTGCCTCTT 
CTTGGTTACTGTAATGAATATGATCAACAAATTCTTGTATATCCTTTCATGTCTAATGGC 
TCTTTGCTAGACAGACTATATGGGGAACCGGCAAAGAGAAAAATACTAGACTGGCCAACT 
AGACTCTCAATTGCTCTAGGAGCAGCTCGAGGTTTGGCATATCTTCACACGTTTCCTGGA 
CGATCTGTAATACACAGGGACGTAAAATCGAGCAATATACTACTGGACCATAGCATGTGT 
GCTAAGGTTGCAGATTTTGGTTTCTCAAAATACGCTCCTCAGGAAGGAGACAGTTATGTT 
TCCCTTGAAGTAAGAGGAACCGCAGGATATCTGGATCCTGAGTACTACAAAACCCAGCAA 
TTATCTGAAAAGAGTGATGTTTTCAGCTTTGGCGTGGTTCTACTAGAAATTGTAAGTGGC 
CGGGAACCTCTCAACATAAAGAGACCACGAGTCGAGTGGAGCTTGGTTGAATGGGCTAAA 
CCATATATAAGAGCATCAAAGGTGGATGAAATTGTAGATCCTGGCATCAAGGGAGGATAT 
CATGCAGAGGCACTGTGGAGAGTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNWNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
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Figure 29. 



NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNTSINNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNTGGCATAGACTTCTGCTTATTGAAGGAGGATGCCAATCCC 
TTCATTTCTCAGCTTGAACTGAGGCCATTACCTGAAGAATACATGCATGATTTCTCTACC 
AGCG T T T TAAAACT GAT AAAC AGAAAT AAT C T TT GT GGCAT AGAAGACG AC AT CAGGT T C 
CCTGTTGACCAAAATGATAGAATCTGGAAAGCAACTTCAAGTCCATCATATGCTGTTCCA 
CTGTCTTTCAATGTCAGTGATGTTGATCTCAACGGCAAAGTGACACCTCCTCGACAAGTC 
CTACAAACAGCTCTTACTCACCCTGATCGATTGGTGTTCGTCCACGACGGCCTCGAGACC 
GATGATTATGAATACTCTGTGTTACTCTACTTTCTTGAACTAAATAACACTGTCAAAGCA 
G G AC AAAGG G T G T T T GAG AT C T AT C T AAAC AG T GAG AT T AAAAAG GAG AG TTTTGATGTA 
T C GG AAG GAG GAT C C AAG TAG AG T T AC AT C ACC T T G AAC AT T T C AG C AAAT G G A T C AC T C 
AATATAACCTTAGCCAAGGCATCTGGATCTAAGTTTGGACCCCTTTTGAATGCTTATGAA 
ATCCTGCAGGCGCGACCATGGATCGATGAGACCGACCAAACAGATGTGGAAGTTATTCAG 
AAG T T G AGAAAAGAAC TAG T G C T G C AAAAC C AAG AC AAT GAAG CAT T GG AG AG T T G GAG T 
GGAGATCCTTGTATGCTGTTTCCCTGGAAAGGAGTAGCATGCGATGGTTCAAATGGTTCG 
TCTGTCATCACTAAGCTGGATCTTTCCTCCAGTAATCTCAAAGGAACAATCCCTTCCAGT 
GTCACTGAGATGACCAAATTACAAATACTGAACCTGAGCCACAACCACTTCGATGGCTAT 
ATCCCCTCGTTTCCATCGTCTTCCTTGTTGATATCAGTAGATCTGAGCTACAATGACCTA 
ACGGGACAGCTTCCAGAATCCATTATCTCACTGCCACATTTAAACTCATTGTATTTTGGT 
T GC AAT CAAC AC AT GAG C G AC GAT GAT GAAG C C AAAT T GAAC AG T T C AC T AAT C AG TAG A 
GATTATGGGAGATGCAAGGCAAAAAGCCCCAAATTTGGACAAGTATTCATGATTGGAGCA 
ATTACAAGTGGATCGATTTTGATTACTTTGGCTGTTGGAATTCTATTTTTTTGCCGTTAT 
AG AC ACAAG T CAAT TACT T T GG AAGGAT T T G G T GGAAAG ACC T AC CCG AT G GCAAC AAAT 
ATAATTTTCTCCTTGCCAAGCAAAGACGATTTCTTCATAAAGTCTGTATCAGTTAAACCG 
TTCTCTTTG GAG TAT AT AG AG T T G G CAAC AG AG AAG T AC AAAAC T T T GAT AG G T GAAG G A 
GGGTTTGGCTCTGTTTACCGAGGCACTCTAGACGATGGTCAAGAAGTGGCAGTGAAAGTC 
C G G T CAGCC ACAT CAAC T C AG G GAAC C CGAG AAT T T GACAAT GAG C T AAAC C T AC T T T C A 
G C TAT AC AAC AT GAG AAC CTGGTGCCTCTTCTTGGT T AC T GT AAT G AAT AT GAT C AAC AA 
ATTCTTGTGTACCCTTTCATGTCTAATGGATCTTTGCTAGATAGACTATATGGGGAACCA 
G C AAAG AG AAAAAT AC TAG AC T GG C CAAC TAG AC T C T C TAT T GC T C T AG GAG C AG C T C G A 
GGTTTGGCATATCTTCACACATTTCCTGGACGGTCTGTAATACACAGGGACGTAAAATCG 
AGCAATATACTACTGGACCATAGCATGTGTGCTAAGGTCGCAGATTTTGGTTTCTCAAAA 
TACGCTCCTCAGGAAGGAGACAGTTATGTTTCCCTTGAAGTAAGAGGAACTGCAGGGTAT 
CTGGATCCCGAGTACTACAAAACCCAGCAATTATCTGAAAAGAGTGATGTTTTTAGTTTT 
GGCGTGGTTCTACTAGAAATTGTAAGTGGCCGGGAACCTCTCAACATAAAGAGACCACGA 
G T C GAG T G G AG C T T G G T T G AAT G G G C T AAAC CAT AC AT AAG AG CAT C AAAG G T G G AT G AA 
ATTGTAGATCCTGGCATCAAGGGAGGATATCATGCAGAGGCACTGTGGAGAGTGGTGGAA 
GTAGCACTGCAATGTCTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
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Figure 30. 
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Figure 31. 
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Figure 32. 
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Table 1. 



Name of 
the F2 
families 


Number of 

the 
F2 seeds 


Number of 

the 
germinated 
F2 seeds 


Number of 
the died 
F2 plants 


Number of 
the Nod" 
F2 plants 


Number of 
the Nod + 
F2 plants 


Nod" : Nod + 
ratio 


NAB 


5571 


4988 


2412 


50 


2526 


1 : 50 


NBW 


1285 


1199 


460 


13 


726 


1 : 55 
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Table 5. (1/4) 
50E23 



50E23 



MA ME HC 

NAME Ur 


DCCTDIPTIAM 

Kto 1 KIU 1 IUN 


insGn size 


Int tLUIMt 


runout ir«i caoc 

ENDONUCLEASE 


(Kb) 


blorl 


UmA III 

Hind III 


i,y 




Utnrl III 

nina til 


0,0 


G18P9 


Umrl III 

nine) ill 


3,6 


G18P17 


1 1:„ J ill 

Hind III 


4,0 


blor74 


UinH III 

nino III 


■i (\ 
1,0 


G18P31 


Xbal 




G18P32 


Xbal 


3,8 


G18P37 


Xbal 


2,4 


G18P65 


EcoRI 


1,0 


G18P71 


EcoRI 


2,8 


G18P75 


BamHI 


1,4 



M AMP OP 


Kto 1 KIU 1 IUN 


insert size 


TUP PI DMP 
I tit ULUIMfc 




(Kb) 


G13P1 


random 


^ a r 

>0,5 


G13P2 


random 


>0,5 


a<i ^p\a 

G13P3 


random 


>0,7 


Aj< Al*\ i( 

G13P4 


random 


>0,8 
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random 


>0,7 


G13P6 


random 


>0,7 


G13P7 


random 


>1,0 


G13P8 


random 


1,5 


G13P9 


random 


1,3 


G13P10 


random 


0,9 


blorl 1 


random 


1,5 


a<< irM r< 

G13P16 


random 


0,8 


a^ aoa a 

G13P32 


random 


1,5 


G13P33 


random 


>0,4 


G13P41 


random 


1,6 


G13P42 


random 


1,3 


G13P45 


random 


1,0 


aj on^7 

G13P47 


random 


J A 

1,0 


G13P49 


random 


2,6 


G13P50 


random 


t A f ' 

>0,5 


G13P51 


random 


v. A A 

>0,6 


G13P53 


random 


1,3 


P^DGC 
bl^rOO 


random 


1,4 


G13P58 


random 


1,0 


G13P59 


random 


>1,0 


G13P60 


random 


_>0,_5 _____ 


G13P61 


random 


"1,3 


G13P62__ 


random 


1,1 


G13P64 " 


random 


1,3 



MA MP AC 

NAR/lt Ur 


DCOTDIPTIftM 
Kto 1 KIU HUM 


insert size 


TUP PI PlMC 


ENDONUCLEASE 


(Kb) 


AvI AHAP 

G13P65 


random 


1,5 


AH OhAA 

G13P66 


random 


j a 

1,6 


f\A APIA A 

G13P68 


random 


1,6 


G13P69 


random 


>0,5 


G13P70 


random 


1,4 


G13P71 


random 


1,4 


G13P72 


random 


1,4 


G13P73 


random 


1,5 


G13P74 


random 


>0,6 


G13P75 


random 


1,2 


G13P76 


random 


1,6 


G13P77 


random 


1,6 


Ait AmA 

G13P78 


random 


1,7 


G13P79 


random 


1,6 


G13P80 


random 


>0,6 


G13P81 


random 


>0,o 


G13P82 


random 


A A 

1,3 


Ai Ap\AA 

G13P83 


random 


A A 

1,2 


G13P84 


random 


v A A 

>0,1 


A J APtAP 

G13P85 


random 


1,2 


G13P86 


random 


>0,7 


G13P87 


random 


A A 

1,3 


bloroo 


M .Jam 

random 


>U,D 


G13P89 


random 


>0,8 


G13P92 


random 


>0,8 


G13P94 


random 


1,2 


G13P95 


random 


>0,3 


G13P96 


random 


>0,8 


G13P97 


random 


1,8 
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Table 5, (cont. 2/4) 



50E23 



50E23 



2D11 



NAME OF 


RESTRICTION 


Insert size 


THE CLONE 


ENDONUCLEASE 


(kb) 


G13P98 


random 


>0,6 


G13P99 


random 


1,4 


G13P100 


random 


0,2 


G13P101 


random 


0,5 


G13P104 


random 


1,5 


G13P105 


random 


6,0 


G13P106 


random 


1,4 


G13P107 


random 


0,7 


G13P108 


random 


6,1 


G13P110 


random 


0,8 


G13P113 


random 


0,1 


G13P114 


random 


0,3 


G13P115 


random 


0,1 


613P116 


random 


0,7 


G13P117 


random 


0,7 


G13P119 


random 


1,5 


G13P120 


random 


0,8 


G13P121 


random 


1,2 


G13P122 


random 


1,6 


G13P123 


random 


0,8 


G13P126 


random 


1,5 


G13P127 


random 


1,5 


G13P128 


random 


1,0 


G13P134 


random 


1,3 


G13P138 


random 


1,1 


G13P153 


random 


1,3 


G13P156 


random 


1,1 


G13P157 _ 


random 


1,2 


G13P158~ 


random 





NAME OF 


RESTRICTION 


Insert size 


THE CLONE 


ENDONUCLEASE 


(kb) 


G13P159 


random 


>0,5 


G13P161 


random 


1,3 


G13P165 


random 


>0,6 


G13P166 


random 


1,5 


G13P167 


random 


1,3 


G13P168 


random 


0,6 


G13P169 


random 


>0,9 


G14P1 


Cladelecid 


3,8 



NAME OF 


RESTRICTION 


Insert size 


THE CLONE 


ENDONUCLEASE 


(kb) 


G3P17 


random 


1,6 


G3P46 


random 


0,8 


G3P71 


random 


1,2 


G3P73 


random 


1,2 


G3P74 


random 


1,7 


G3P91 


random 


1,5 


G3P92 


random 


0,9 


G3P99 


random 


>1,7 


G3P105 


random 


1,7 


G3P107 


random 


2,3 


G3P123 


random 


1,3 


G3P126 


random 


1,5 


G3P136 


random 


1,0 


G3P139 


random 


1,3 


G3P147 


random 


1,1 


G3P168 


random 


1,4 


G3P178 


random 


1,5 


G3P207 


random 


0,9 


G3P222 


random 


1,1 


G3P224 


random 


1,2 


G3P239 


random 


1,1 


G3P245 


random 


" 0,6 


G3P268 


random 


0,8 


G3P287 


random 


0,3 


G3P289 


random 


0,9 


G3P301 


random 


0,9 


G3P411 


random 


1,1 


G3P413 


random 


_.._M_ . 


G3P477 " 


random 


0,9 



01 

10 
-> 
0) 

o 



2D11 


NAME OF 


RESTRICTION 


Insert size 


THE CLONE 


ENDONUCLEASE 


(kb) 


G3P483 


random 


>1,7 


G3P499 


random 


0,9 


G3P512 


random 


>0,5 


G3P514 


random 


>0,7 


G3P555 


random 


0,7 


G3P576 


random 


1,2 


G3P578 


random 


0,7 


G5P1 


BamHI 


1,5 


G5P4 


BamHI 


1,0 


G5P10 


BamHI 


3,1 


G5P14 


BamHI 


3,2 


G6P3 


Hind III 


1,7 


G6P4 


Hind III 


1,6 


G6P6 


Hind III 


2,7 


G6P11 


Hind III 


0,8 


G6P14 


Hind III 


1,7 


G6P15 


Hind III 


0,5 


G6P17 


Hind III 


1,8 


G6P18 


Hind III 


1,6 


G6P21 


Hind III 


0,3 


G6P26 


Hind III 


1,8 


G10P4 


Hind III 


2,7 


G10P8 


Hind III 


2,7? 


G10P25 


Hind III 


4,0 


G11P1 


Hind III 


3,3 


G11P9 


HindJN 


9,1? 


_G11P13 


Hind jll^ 


9,0 


_JS11P17__ 


-JMIL. 


6JL__ 


G11P20 


Hind III 


3,3 


" G11P24 


Hind III 


8,0 



Table 5. (cont. 3/4) 



2D11 



NAME OF 


RESTRICTION 


Insert size 


THE CLONE 


ENDONUCLEASE 


(kb) 


G11P34 


Hind III 


4,4 


G11P36 


Hind III 


7,4 


G11P37 


Hind III 


0,6 


G11P38 


Hind III 


4,7 


G11P45 


Hind III 


2,5 


G19P21 


Taq deletion 


0,9 


G19P27 


Taq deletion 


0,6 


G19P42 


Taq deletion 


0,1 


G19P46 


Taq deletion 


0,8 


G19P48 


Taq deletion 


0,8 


G19P73 


Taq deletion 


0,8 


G19P74 


Taq deletion 


0,7 


G19P81 


Taq deletion 


0,6 


G19P94 


Taq deletion 


0,8 



67A11 



NAME OF 


RESTRICTION 


Insert size 


THE CLONE 


ENDONUCLEASE 


(kb) 


G9P13 


random 


0,8 


G9P43 


random 


11 


G9P62 


random 


10 


G20P1 


Nhel 


09 


G20P2 


Nhel 

111 ICI 


96 


G20P4 


Nhel 

Ml Id 


65 


G20P5 


Nhel 

Ml ICI 


88 


G20P7 


Nhel 


4,4 


G20P14 


Nhel 


14 


G20P15 


Nhel 

111 Ivl 


56 

VjW 


G20P21 


Nhel 


0,4 


G20P31 


Nhel 


50 


G20P38 


Nhel 


17 


G20P56 


Nhel 


12,0 


G20P140 


Nhel 


5,5 


G21P25 


Xbal 


86 


G21P26 


Xbal 


42 


G21P34 


Xbal 


65 


G21P35 


Xbal 


22 


G21P43 


Xbal 


33 


G21P90 


Xbal 


2,0 


G21P136 


Xbal 


26 


G21P186 

vC 1 1 1 UU 


Xbal 

AUCU 


65 - 


G21PTIQ 


AUCll 


5 


G21P448 


Xbal 


3,5 


G22P2 


Pael-Taql del. 


3,8 


G22P5 


Pael-Taql del. 


__.„JA,_ 


G22P7 


Pael-Taql del. 


4,1 


G22P8 


Pael-Taql del. 


2,1 


G22P19 


Pael-Taql del. 


4,5 



67A11 



NAME OF 


RESTRICTION 


Insert size 


THE CLONE 


ENDONUCLEASE 


(kb) 


G23P4 


Sacl-Taql del. 


6,1 


G23P5 


Sacl-Taql del. 


3,3 


G23P17 


Sacl-Taql del. 


7,2 


G23P23 


Sacl-Taql del. 


2,6 


G24P245 


random 


1,0 


G24P413 


random 


0,9 


G24P418 


random 


1.1 


G24P442 


random 


1,1 


G29P10 


Hin2l 


1,5 


G29P23 


Hin2l 


0,7 


G29P96 


Hin2l 


4,5 


G29P117 


Hin2l 


0,8 


G29P141 


Hin2l 


0,9 


G30P193 


Hin6l 


6,3 


G31P81 


Tail 


5,5 


G32P68 


Taql 


0,4 


G32P90 


Taql 


0,3 


G32P96 


Taql 


1,8 


G32P112 


Taql 


2,7 


G32P126 


Taql 


5,0 


G32P194 


Taql 


0,5 


G32P286 


Taql 


3,3 


G33P16 


EcoRI 


6,0 


G33P48 


EcoRI 


2,3 


G33P63 


EcoRI 


5,0 


G33P64 


EcoRI 


3,0 


G33P121 


EcoRI 


4,5 


_G33_P15_5_ 


EcoRI 


1,6 


G34P20 


Hindlll 


1,4 


G34P30 


Hindlll 


7 


"(334P34 


"Hindi? 


"2,3" 



Table5.(cont 4/4) 
67A11 



NAME OF 


RESTRICTION 


Insert size 


THE CLONE 


ENDONUCLEASE 


(kb) 


G34P42 


Hindlll 


1,8 


G34P44 


Hindlll 


4,2 


G34P46 


Hindlll 


2,8 


G34P52 


Hindlll 


4,6 


G34P64 


Hindlll 


2,6 


G34P101 


Hindlll 


0,6 


G34P173 


Hindlll 


2,2 


G34P192 


Hindlll 


3,0 


G35P65 


SaulllAI 


0,7 


G35P85 


SaulllAI 


1,4 


G35P136 


SaulllAI 


1,3 


G50P4 


Kpnl deletion 


3,5 



I 

0 



i 
0 

o 

H 
O 

OC 
4- 
H 



n 

o 

O 

o 

71 
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Table 6. 



Structural 


First nucleotide 


Last nucleotide 


element 


Position 


EXON 1 


<897 


1248 


EXON 2 


1564 


2074 


EXON 3 


2798 


3273 


EXON 4 


3372 


3522 


EXON 5 


3800 


3871 


EXON 6 


3979 


4047 


EXON 7 


4519 


4590 


EXON 8 


4702 


4773 


EXON 9 


4860 


5032 


EXON 10 


5542 


5768 


EXON 11 


5912 


6037 


EXON 12 


6110 


6178 


EXON 13 


6797 


6986 


| EXON 14 


7340 


7472 


EXON 15 


7562 


8236< 




INTRON 1 


1249 


1563 


INTRON 2 


2075 


2797 


INTRON 3 


3274 


3371 


INTRON 4 


3523 


3799 


INTRON 5 


3872 


3978 


INTRON 6 


4048 


4518 


INTRON 7 


4591 


4701 


INTRON 8 


4774 


4859 


INTRON 9 


5033 


5541 


INTRON 10 


5769 


5911 


INTRON 11 


6038 


6109 


INTRON 12 


6179 


6796 


INTRON 13 


6987 


7339 


INTRON 14 


7473 


7561 
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Table 7. 



NAME OF 
THE 
CLONE 


CLONING 
VECTOR 


ENZYMES 


MODIFICATION 


NORK 
FRAGMENT 


ANTIBIOTIC 
RESISTANCE 
(pa/mh 


Name of the 
E. coli strain 


pBRC1667 


pAT680 


Kpn\ 


dephosphorilation 


8.8 kb Nhel 


kanamycin (200) 


BRC1667 


pBRC1678 


pPK459 


Kpn\ 


dephosphorilation 


8.8 kb Nhel 


tetracycline (10) 
kanamycin (50) 


BRC1678 


pBRC1666 


pPR97 


Cla\-Sac\ 


dephosphorilation 


8.8 kb Nhel 


kanamycin (200) 


BRC1666 


PBRC1701 


pPR97 


C/al 


dephosphorilation 


1 1 .8 kb Clal 


kanamycin (200) 


BRC1701 
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Table 8. 



Agrobacterium strain 


i TRANSFORMED 
PLASMID 


NAME OF THE STRAIN 

1 ■ # 1 1 ■ 1 l-a 1 III i— « I I \/ III 1 

CARRYING THE NORK 
GENE 


A, rhizogenes A4 


PBRC1667 


BRC1673 


A. rhizogenes 1 334 


pBRC1667 


BRC1675 


A. rhizogenes 1 724 


PBRC1667 


BRC1679 








A. tumefaciens LBA4404 


pBRC1667 


BRC1677 


A. tumefaciens LBA4404 


PBRC1678 


BRC1680 


A. tumefaciens LBA4404 


PBRC1666 


BRC1681 


A. tumefaciens LBA4404 


PBRC1701 


BRC1707 
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Table 9. 



NAME OF THE 

STRAINS 
CARRYING THE 
NORK GENE 


NUMBER OF THE 
PLANTS 
DEVELOPING 
ROOTS 


NUMBER OF THE 
GUS + ROOTS 


NUMBER OF THE 
NOD + PLANTS 


BRC1673 


78 (5) 


1 (0) 


0 


BRC1675 


87(2) 


3(0) 


0 
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Table 10. 



Plants to be 
transformed 


Number of shoots or 
leaves treated with 
Agrobacterium 


Number of 
regenerated 
plants 


Number of 
transformed 
plants 


Number of Nod + plants 


F1RN28 


157 


19 


2 


2 


F1RN41 


133 


14 






F2RN28/4 


120 








F2RN28/5 


75 
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Table 11. 



Plants to be 
transformed 


Number of 
shoots or 

leaves treated 
with 

Agrobacterium 


Total number 
of 

transformants 


Number of 
independent 
transformants/ 
Total number 
of 

transformants 


Presence of the M. 
truncatuia NORK 
sequence/Number of 
independent 
transformants 


Presence of the NORK 
protein/Number of 
independent 
transformants 


Nicotiana tabacum 


113 


15 


6/15 


6/6 


6/6 


Vicia/Lotus 


232 


26 


8/26 


8/8 


nt 
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